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A COMPARATIVE EXPERIMENTAL STUDY OF THE PUPIL- 


SELF-STUDY METHOD AND THE MODERN-SYSTEMATIC 


METHOD OF TEACHING SPELLING* 
C. W. DuPEE 


State Teachers College 


THE PROBLEM 


The purpose of this study was to compare 
by means of experimentation the pupil-self- 
study method and the modern-systematic 
method of teaching spelling with regard to: 
(1) the amount of learning achieved; (2) 
the rate at which learning is achieved; (3) 
the extent to which learning acquired by each 
of the two methods is retained; and (4) the 
use pupils make of the time saved through 
exemptions. 


SOURCES OF THE DATA 


Third, fourth, fifth, and sixth grade pupils 
and teachers in five different school systems 
in the northeastern part of the state of Penn- 
sylvania furnished the data. The names and 
the locations of the school systems that par- 
ticipated are as follows: 

Bangor Public Schools, Bangor, Pa. 

East Stroudsburg Public Schools, East 
Stroudsburg, Pa. 

Pen Argyl Public Schools, Pen Argyl, Pa. 

Stroudsburg Public Schools, Stroudsburg, 
Pa, 

Training School, State Teachers College, 
East Stroudsburg, Pa. 


DISTRIBUTION OF PUPILS AND TEACHERS 


The distribution of the pupils and the 
teachers in the various grades is revealed in 
Table I. The table is read: The total num- 
ber of third grade pupils participating in the 
experiment was 305; the total number of 
third grade teachers participating in the ex- 
periment was 9. The data for grades four, 
five, and six are read in the same manner 
as the data for grade three. 


.* This report is based on a dissertation which the writer sub- 
mitted in partial fulfillment of the requirements for the de- 
gree of Doctor of Philosophy in the School of Education of 
New York University. 


East Stroudsburg, Pennsylvania 





TABLE I 


DISTRIBUTION OF PUPILS AND TEACHERS 
BY GRADES 


Total Number Total Number 


of Pupils of Teachers 
Grade Participating Participating 
Re ee 305 i) 
ih sel cleialideiatacdiniandiaada 346 10 
_ Ree aetna 299 9 
_ Pee etek ene 298 8 
Ree 1248 36 


DISTRIBUTION OF PUPILS BY CLASSES 


The pupil distribution in the various classes 
in a given grade is indicated in Table II. The 
table reads as follows: The number of pu- 
pils in the third grade classes was 46, 39, 38, 
37, 34, and so on. The mean number of 
pupils in the third grade classes was 33.9. 
The data for the remaining grades are read 
in the same manner as the data for grade 
three. 


TABLE II 
DISTRIBUTION OF PUPILS BY CLASSES 
Grade Enrollment by Classes Mean 
3--. 46 39 38 37 34 31 29 28 23 33.9 
4__. 40 40 40 39 38 35 30 28 28 28 34.6 
5 _._. 40 40 36 35 33 32 30 28 25 33.2 
6 ___ 47 44 39 37 35 34 34 28 37.2 





Table II shows that the classes were large 
rather than small and were therefore typical 
of most classes in city public schools. Hence 
the results are practical and not theoretical. 


QUALIFICATIONS OF TEACHERS 


Table III records the amount of training 
of the teachers in the various grades. The 
table shows that all of the teachers had two 
or more years of training. Twenty-five per 
cent of the teachers participating had a bach- 
elor’s or a master’s degree. Many of the 
teachers listed as graduates of a two year 

























































normal course had earned additional credits 
toward the bachelor’s degree. The teachers 
were, according to the teacher training stand- 
arde of Pennsylvania, well trained. 
TABLE III 
AMOUNT OF TRAINING OF TEACHERS 
Graduate 


2 Year 
Normal Bachelor’s Master’s 


Total 
No. of 


Grade Course Degree Degree* Teachers 
ra, 1 1 9 
D commie. 1 1 10 
ae 2 1 ] 
ee 1 1 8 
Total  .. 27 5 4 36 


The teachers’ teaching experience is indi- 
cated in Table IV. The table shows that the 
teachers as a group were mature. The mean 
number of years of teaching experience was 
14.97 Or approximately 15 years. None of 
the teachers was inexperienced or teaching 
for the first time. Only one of the thirty- 
six teachers included in the study was in her 
second year of teaching. Only six teachers 
or 16 2/3 per cent of the total number of 
teachers had taught less than five years. 
Seven of the teachers, or approximately 20 
per cent of the entire group, had taught for 
twenty-five or more years. From the stand- 
point of experience, the teachers were well 
able to teach spelling. 


TABLE IV 
TEACHING EXPERIENCE OF THE TEACHERS 


Years Years Years Years Years Years Years 


Grade O-.9 I1-1.9 2-49 5-9.9 10-14.9 15-24.9 25-40 
Oe ecanii 2 3 2 2 
' 3 3 2 2 
—_—- 1 1 2 4 1 
= 3 1 2 2 
Mean_- 14.97 


EQUATING PUPILS AND TEACHERS 


The alternate method was used in equating 
such variable factors as initiative, mental ca- 
pacity, attitudes, home environment, etc., on 
the part of both pupils and teachers. That 
is to say, a teacher and his or her pupils pro- 
ceeded by the pupil-self-study method for a 
period of twenty days; during the next twen- 
ty-day period said teacher and pupils alter- 
nated methods and proceeded by the modern- 
systematic method. 


* The teachers with the master’s degree were training teach- 
ers in the Training School at the State Teachers College at 
East Stroudsburg, Pa. A very large portion of the teaching 
done in their grades was done by student-teachers under the 
direction of the training teachers. 
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As a precaution against any advantages one 
of the methods might have over the other be- 
cause of initial use, approximately one half of 
the teachers in each of the four grades par- 
ticipating, together with their pupils, used the 
pupil-self-study method as the initial or be- 
ginning method. The remaining teachers and 
pupils used the modern-systematic method as 
the initial method. Thereafter the groups al- 
ternated methods with each successive twen- 
ty-day period. 


PROCEDURE 
General 

1. A meeting of the teachers concerned was 
held in each of the school systems participat- 
ing. At these meetings, the purpose of the 
experiment, the experimental procedure, and 
other items of interest or importance were 
explained. By the time the meetings were 
concluded, the teachers in each grade had 
divided themselves into two groups in such 
a manner that approximately one half of the 
total number of pupils in a given grade could 
use the pupil-self-study method as the initial 
or beginning method, while the other half of 
the grade used the modern-systematic method 
as the initial method. 

2. The experiment continued for four learn- 
ing periods of twenty days each. Each 
teacher alternated the method of teaching 
with each successive learning period. -For ex- 
ample, a teacher who taught by the modern- 
systematic method during the first twenty-day 
period taught these same pupils by the pupil- 
self-study method during the second twenty- 
day period, by the modern-systematic method 
again during the third period, and by the pu- 
pil-self-study method during the fourth 
period. Teachers using the pupil-self-study 
method as the initial method alternated in like 
manner. 

3. The time allotted to spelling on the daily 
class schedule was twenty minutes per day. 

4. The one hundred words’ or the subject 
matter taught in a given grade during a learn- 
ing period was the same for both groups re- 
gardless of the method of teaching or learn- 
ing used. 


Pupil-Self{-Study Method 
1. Teachers using the pupil-self-study 
method proceeded on the first day of a twen- 


‘All words were graded and arranged according to Henry 
Carr Pearson and Henry Suzzallo. Essentials of Spelling, pp. 
21-34, 46-62, 77-93, 109-123. New York: American Book 
Co., 1919. 
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ty-day learning period as follows: (1) gave 
a pre-study test (written form) on the one 
hundred words in the contract for the twen- 
ty-day period; (2) collected test papers; (3) 
supplied each pupil with a printed copy of the 
one hundred words in the contract for that 
period; (4) advised the pupils that they were 
to do their own studying when the pupil-self- 
study method was used, that they would be 
tested daily (written form) on the one hun- 
dred words in the contract, and that a pupil 
would be exempt from spelling, except for the 
final test, when he or she made a perfect score 
on two successive tests. 

2. From the second to the nineteenth day 
of the learning period, inclusive, the teachers 
used the twenty minutes allotted to spelling 
on the daily class schedule for the purpose of 
giving the pupils who were not exempt a writ- 
ten test on the one hundred words in the con- 
tract for that period. 

3. On the twentieth or last day of the con- 
tract period, the teachers gave a final test, 
written form, on the one hundred words stud- 
ied during the twenty-day period. All of 
the pupils, exempt as well as non-exempt, 
were required to take the final test. 

4. When a pupil became exempt, he was 
asked to note in diary fashion the use or uses 
he made of the spelling period the days on 
which he was exempt. 

5. All test papers were scored and returned 
to their owners at the earliest possible con- 
venience. 

6. A chart was posted on the bulletin board 
at the beginning of each twenty-day learn- 
ing period. The number of words misspelled 
or the learning status of each pupil was re- 
corded on the chart daily. 

The primary duties of the teacher when the 
pupil-self-study method was used were group 
testing, analyzing results, diagnosing individ- 
ual cases, and causing or allowing each pupil 
to work up to capacity. In other words, the 
pupil-self-study method made allowances and 
provisions for individual differences. Teach- 
ers were requested to refrain from using mass 
instruction when the pupil-self-study method 
was employed. 


Modern-Systematic Method 

1. The twenty-day learning period was di- 
vided into four weeks or periods of five days 
each. The one hundred word contract for 
that twenty-day period was likewise divided 
into four units of twenty-five words each. 
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The first twenty-five words in the contract 
comprised the subject matier to be taught 
during the first week of the contract period; 
the second twenty-five words comprised the 
subject matter for the second week; and so 
on for the third and fourth weeks. 

2. The teachers proceeded on the first day 
of the twenty-day period as follows: (1) 
gave a pre-study test*, written form, on the 
one hundred words in the contract for the 
twenty-day period; (2) collected test papers; 
(3) gave pupils printed copies of the first 
twenty-five words in the contract for the pe- 
riod; (4) informed the pupils that they would 
all study spelling together during the regular 
spelling period each day; none would be ex- 
empt from spelling; and, spelling could be 
studied at home or outside of the regular 
spelling period if the pupils so desired. 

3. The general procedure on the first and 
the last day of the twenty-day period differed 
from the corresponding weekly procedure to 
the extent that the pre-study test and the final 
or after-study test given on these days in- 
cluded the entire list of one hundred words 
for that period, whereas the weekly tests in- 
cluded only the twenty-five words for the 
week. (The reasons for this difference are 
obvious. ) 

4. The duties of the teacher on a given day 
of the week, except for the first and the last 
day of the twenty-day period, were: 

Monday, give a pre-study test on the twen- 
ty-five words to be taught during that week; 
use the balance of the period for the purpose 
of teaching the pupils the correct spelling of 
some of the twenty-five words in the week’s 
assignment. 

Tuesday, use the twenty minutes allotted 
to spelling on that day for teaching purposes. 
(With the modern-systematic method, each 
teacher was free at all times to use his fa- 
vorite or most effective technique for teach- 
ing spelling.) 

Wednesday, proceed in the same manner 
as on Tuesday. 

Thursday, give an after-study test on the 
twenty-five words taught during the week; 
use the remainder of the period for review- 
ing or re-teaching purposes. 

Friday, conduct a review lesson with the 
class as a whole for the purpose of fixing in 
the pupils’ minds the troublesome or un- 


* These tests were scored but, for obvious reasons, the 
Papers were not returned to the pupils until the end of the 
twenty-day period. 
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learned words in that portion of the contract 
studied to date. 

5. On the twentieth or last day of the con- 
tract period, a final test was given (written 
form) on the one hundred words in the con- 
tract for that period. 

6. The teachers recorded the results of the 
pre-study tests on Monday and the after- 
study tests on Thursday of each week on a 
chart posted on the bulletin board for that 
purpose. 

With the modern-systematic method, the 
primary duty of the teacher was methodically 
directing the learning of the pupils en masse. 


Retention 

Another procedure was necessary for col- 
lecting the data on retention. One month 
after the date of the final tests on Contract 
IV, a retention test was given to each of the 
four grades participating. Two hundred 
words chosen alternately, fifty words from 
each of the four contracts studied or one half 
of the total number of words taught in any 
one grade, comprised the retention test for a 
given grade. 

Scoring the retention tests and tabulating 
the results concluded the data-collecting part 
of the experiment. 


SUMMARY OF FINDINGS 


The findings of the study may be summa- 
rized as follows: 


1. The mean number of words learned per 
pupil per contract period by each of the four 
grades participating was greater when the pu- 
pils studied by the pupil-self-study method 
than when they studied by the modern-sys- 
tematic method. When the pupils studied by 
the pupil-self-study method the least mean 
number of words learned per pupil per con- 
tract period, namely, 35.47 words, was 
learned by the fifth grade pupils; the greatest 
mean number of words learned per pupil per 
contract period, namely, 39.79 words, was 
learned by the third grade pupils. 

When the pupils studied by the modern- 
systematic method the least mean number of 
words learned per pupil per contract period, 
namely, 28.26 words, was learned by the 
sixth grade pupils; the greatest mean number 
of words learned per pupil per contract pe- 
riod, namely, 33.06 words, was learned by the 
third grade pupils. 
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The mean number of words learned per 
pupil per contract period was as follows: 


Pupil-Self-Study Modern-Systematic 


Method Method 
ee A 
Grade 4___.--- Seipacacataucsedainenn 28.59 
9 See ° eee 28.68 
Grade 6______- Eee 28.26 


2. The difference in means or the extent 
to which the mean number of words learned 
per pupil per contract period by the pupil- 
self-study method exceeded the mean number 
of words learned per pupil per contract period 
by the modern-systematic method ranged 
from 6.73 words or 20.36 per cent per pupil 
per contract period for the third grade pupils 
to 8.05 words or 28.49 per cent per pupil per 
contract period for the sixth grade pupils. 

The difference in the means for the various 
grades was as follows: 


ee 6.73 words or 20.36 per cent 
Se 7.05 words or 24.66 per cent 
Grade 5____---- _.6.79 words or 23.68 per cent 
Ee 8.05 words or 28.49 per cent 


3. The mean rate of learning per pupil 
twenty-minute period of school time devoted 
to spelling was greater for each of the four 
grades participating when the pupils studied 
by the pupil-self-study method than it was 
when they studied by the modern-systematic 
method. When the pupils studied by the pu- 
pil-self-study method, the least mean num- 
ber of words learned per pupil twenty-minute 
period of school time devoted to spelling, 
namely, 2.391 words, was credited to the 
fourth grade pupils; the highest mean rate for 
the pupil-self-study method, namely, 2.817 
words, was credited to the sixth grade pupils. 

When the pupils studied by the modern- 
systematic method, the least mean number of 
words learned per pupil twenty-minute period 
of school time devoted to spelling, namely, 
1.395 words, was credited to the sixth grade 
pupils; the highest mean rate for the modern- 
systematic method, namely, 1.630 words, was 
credited to the third grade pupils. 

The difference in the means for the various 
grades ranged from .896 of a word or 54.97 
per cent for the third grade pupils to 1.422 
words or 101.94 per cent for the sixth grade 
pupils. 

The mean rate of learning per pupil twen- 
ty-minute period of school time devoted to 
spelling was as follows: 
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Pupil-Self-Study Modern-Systematic 


Method Method 
Grade 3.---2.526 words-_.-.-.-- 1.630 words 
Grade 4...-2.391 words------- 1.396 words 
Grade 5----2.622 words------- 1.412 words 
Grade 6...-2.817 words------- 1.395 words 


4, In grades three, four, and six, the mean 
per cent of learning retained per pupil per 
contract period was somewhat greater when 
the pupils studied by the modern-systematic 
method than it was when they studied by the 
pupil-self-study method. In the fifth grade 
the order was reversed. 

The mean per cent of learning retained per 
pupil per contract period when the pupils 
studied by the pupil-self-study method ranged 
from 72.82 per cent for the fifth grade pupils 
to 92.41 per cent for the third grade pupils; 
when the pupils studied by the modern-sys- 
tematic method the range was from 70.64 per 
cent for the fifth grade pupils to 97.43 per 
cent for the third grade pupils. 

The third grade pupils showed the greatest 
difference, namely, 5.02 per cent, in the mean 
per cent of learning retained; the fourth grade 
pupils showed the least difference, namely, .59 
of one per cent, in the mean per cent of learn- 
ing retained. 

The mean per cent of learning retained by 
each of the four grades participating was as 
follows: 


Pupil-Self-Study Modern-Systematic 
Method Method 
Grade 3_..92.41 per cent_____97.43 per cent 
Grade 4___84.40 per cent____- 84.99 per cent 
Grade 5_._72.82 per cent__-__- 70.64 per cent 
Grade 6__.75.35 per cent____- 78.31 per cent 


5. The mean number of words forgotten* 
per pupil per contract period was somewhat 
greater for each of the four grades participat- 
ing when the pupils studied by the pupil-self- 
study method than when they studied by the 
modern-systematic method. The dificrence 
in the means for a given grade ranged from 
1.14 words for the fifth grade pupils to 2.27 
words for the sixth grade pupils. —~ 

The mean number of words forgotten per 
pupil per contract period for each of the two 
methods of study employed was as follows: 


Pupil-Self-Study Modern-Systematic 
Method 


Method 
Grade | ee 3.34 words____.-_- 1.49 words 
Grade 4_____5.74 words_____-__- 4.02 words 
Grade ellie al 9.41 words._...__- 8.27 words 
Grade 6_____ 8.22 words_______-_ 5.95 words 


* The reader will please note that the terms “mean number 
of words forgotten’ and ‘‘mean per cent of learning retained’’ 
are not synonymous in meaning. 
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6. The total number of twenty-minute 
spelling periods saved through exemptions 
when the pupils studied by the pupil-self- 
study method was 11,845. In other words, 
during the half of the experiment in which 
the pupils studied by the pupil-self-study 
method, the pupils saved a total of 3,948 and 
1/3 clock hours of school time. 

The time saved through exemptions was 
equal to 29.20 per cent of the total time 
allotted to the pupil-self-study method. 

The total number of twenty-minute spell- 
ing periods the pupils in a given grade were 
exempt when they studied by the pupil-self- 
study method was as follows: 


Se — 
OS SSS 
ers 3348 
ore 3613 


7. In keeping with tradition and common 
practice, the pupils were not permitted to save 
time through exemptions when they studied 
by the modern-systematic method. 

8. Three items, namely, “did home work”’, 
“read a library book”’, and “drew pictures”, 
included 92.43 per cent of the total number 
of entries made in the diaries which the pupils 
kept when they were exempt. “Did home 
work”, meaning, prepared next day’s assign- 
ment in arithmetic, geography, etc., included 
38.84 per cent of the total number of entries; 
“read a library book” included another 38.46 
per cent; and, “drew pictures’’ included an- 
other 15.13 per cent. The remaining 7.57 
per cent of the total number of entries were 
sparsely distributed over a large number of 
items. 


CONCLUSIONS 


The conclusions or answers to the prob- 
lems proposed at the beginning of the study 
may, in accordance with the data and their 
treatment, be stated as follows: 

1. There is practically complete certainty, 
more than 99.9 chances’ in 100, that the true 
difference in the mean number of words 
learned per pupil by third, fourth, fifth, and 
sixth grade pupils is greater than zero when 
the pupils study by the pupil-self-study 
method and by the modern - systematic 
method. The difference is in favor of the 
pupil-self-study method. 

2. The mean rate of learning per pupil 
twenty-minute period of school time devoted 


1 Henry E. Garrett, Statistics in Psychology and Education, 
p. 134. New York: Longmans, Green and Company, 1926. 
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to spelling is greater when third, fourth, fifth, 
and sixth grade pupils study by the pupil- 
self-study method than it is when they study 
by the modern-systematic method. 

3. The mean per cent of learning retained 
by third, fourth, fifth, and sixth grade pupils 
is about the same when the pupils study by 
the pupil-self-study method as it is when they 
study by the modern-systematic method. 





[Vol. 6, No. 1 


4. In a very large majority of cases, third, 
fourth, fifth, and sixth grade pupils make 
worthwhile use of the time they save through 
exemptions. The large amount of time saved, 
together with the use pupils make of their 
exemption periods, is conclusive evidence that 
the pupil-self-study method has great possi- 
bilities as a way or a means of providing for 
individual differences and of varying and en- 
riching the curriculum. 
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THE RELATIVE EFFICIENCY OF THE “SEPARATE” AND 
“TOGETHER” METHODS OF TEACHING HOMONYMS' 


KeitH C. HARDER 
Chattanooga, Tennessee 


The spelling of homonyms has ever been a 
source of difficulty for both teachers and 
pupils. An examination of thirty-eight spell- 
ing books published since 1826 revealed many 
ingenious arrangements for grouping, defin- 
ing, and illustrating homonyms in lessons. 

The problem of confusions in spelling was 
noticed by Burnham? as early as 1906 when 
he wrote: 

“In regard to certain words many persons 
are the victims of chronic interference of as- 
sociation. Examples are the noun advice and 
the verb advise; ‘principal’, an individual, 
and ‘principle’, a rule of conduct — 

“In some respects prevalent methods might 
be revised with advantage. The usual method 
of teaching spelling of homonyms, for exam- 
ple, is to present them simultaneously. The 
result is apt to be interference of association. 
To avoid this they should be taught sep- 
arately. The very similarities that are sup- 
posed to aid the pupil become the source of 
confusion. In such cases the slight differ- 
ence, not the general similarity, is the im- 
portant thing.” 

In 1911 Pearson* made the first experimen- 
tal attack on this problem when he studied 
the teaching of homonyms at the Horace 
Mann School in New York. Each of the 
third, fourth, fifth, sixth, and seventh grades 
used had two sections with a teacher for each. 
The general plan of the experiment was: (1) 
to test the ability of each grade to spell the 
selected homonyms, (2) to have these homo- 
nyms taught by the regular teacher of one 
section of a grade by the “separate” method 
and by the other teacher of that grade by the 
“together” method, (3) to test the pupils’ 
ability to spell these homonyms, and (4) to 
compare the results of the two methods. In 
one section of a grade the pair of homonyms 


* Abstract of a dissertation submitted in partial fulfillment 
of the requirements for the degree of Doctor of Philosophy, 
in the Department of Education, in the Graduate College of 
the State University of Iowa, August, 1937. 

_? Wm. H. Burnham, “The Hygiene and Psychology of 
Spelling,” Pedagogical Seminary, XIII (December, 1906), 495. 

“Henry C. Pearson, “The Scientific Study of the Teaching 

7) a eal of Educational Psychology, 11 (May. 


was presented adjacent to each other in the 
same lesson, and in the other section with an 
interval of three or four days between the 
words of each pair. 

The results showed: (1) a gain for the 
“together” method in every grade except the 
fifth, (2) an advantage for the “together” 
method for the poor spellers. The experi- 
ment was repeated at Montclair, New Jersey, 
with the results confirming those found by 
Pearson. 

Several weaknesses may be observed in the 
study as carried out: 

1. No attempt was made to control the fac- 
tor of individual teacher personality. 

2. The interval between the appearance of 
the members of a pair (three or four days) 
when taught by the “separate’’ method was 
too short. 

3. The sections used were not equated on 
spelling ability. 

4. A longer interval than ten days after 
study should elapse before testing for perma- 
nent recall. 

5. The number of cases used was too small! 
to be a representative sample. 


Several of the criticisms noted above were 
observed by Finkenbinder* in his review of 
this study. In 1923 Finkenbinder made the 
second experimental attack on this problem 
using three fourth grade classes in the public 
schools of Worcester, Massachusetts. Sixty 
homonyms were selected, twenty-four pairs of 
which were from the spelling books in use, 
and twelve homonyms, not pairs, from outside 
the spelling books. These homonyms and 
eighty-four other words were grouped into 
lessons of eighteen words each, and taught 
during a period of eight days, ten minutes of 
study per day. In order to equalize the ef- 
fect of the personality of each teacher on the 
two methods and to insure that each pupil 
studied both lists of words, a rotation pro- 


*E. O. Finkenbinder, “The Spelling of Homonyms: An Ex- 
perimental Investigation of Teaching Them,’ Pedagogical 
Seminary, XXX (September, 1923), 241-251. 
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cedure was employed. It is briefly described 
as follows: 

The first twelve pairs of the forty-eight 
homonyms were studied by the “separate” 
method in one class and by the “together” in 
another class. The second twelve pairs were 
studied by the “together’’ method in the first 
class named above, and by the “separate” 
method in the other class. In a third class 
one half of the first twelve pairs and one half 
of the second twelve pairs were studied by 
the ‘separate’ method and the remaining one 
half of the first and second twelve pairs were 
studied by the “together” method. The pu- 
pils were tested at the close of the study and 
in two of the three classes, one month later. 


Results: 

1. The preliminary test showed that in one 
fourth of the pairs neither word was correctly 
spelled; and in about one half of the pairs, 
only one word was misspelled. 

2. In the test given immediately after 
study, the methods were found to be of almost 
equal value. On this test the middle fifty 
per cent of the pupils, ranked according to 
ability, showed a superiority for the “sep- 
arate” method. The poorest spellers showed 
a superiority for the “together” method. 

3. The results for the test one month after 
study show: 

a. For the group of hemonyms, both of 
which required study, the pupils taught by 
the “separate” method excelled in spelling ac- 
curacy those taught by the “together” method 
by 17.7 per cent. 

b. For the group of words, only one of 
which required study, the “separate” method 
excelled the “together” method by .5 per cent. 


Criticism of the study: 

1. Not enough time (eight days) was given 
to the study as a whole. 

2. In the “separate” method the interval 
between the studying of one homonym and 
the other member of a pair was too short. 

3. The number of pupils (three classes) 
was too small to give reliable results. 

4. In the analysis of the results Finken- 
binder’s conclusions are limited in their scope 
by the small numbers of pupils and *omo- 
nyms. His main conclusions are based on six 
pairs of homonyms (one fourth of the pairs 
studied had both words misspelled on the pre- 
test). It was on this group that differences 
large enough on which to base recommenda- 
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tions as to method were found. Twelve pairs 
were in the group which had one word spelled 
correctly on the pretest. This group showed 
an advantage of only .5 per cent for the “sep- 
arate” method. 


Foran® summarizes the present status of 
experimental evidence on the teaching of hom- 
onyms as follows: 


“Homonyms constitute a class of words 
that present considerable difficulty, and the 
best method of teaching them cannot be de- 
fined on the basis of existing evidence. The 
‘together’ and ‘separate’ methods are both 
used, and each may be defended on some 
basis or other. The early studies by Pearson 
support the teaching of homonyms ‘together’. 
Later, Finkenbinder found that the ‘separate’ 
method produced better results. Neither 
series of experiments was entirely free from 
defects that would tend to invalidate the re- 
sults regardless of whatever method will prove 
to be superior when the issue is eventually 
decided. Homonyms form one of the most 
difficult groups of words that pupils are called 
upon to spell. A recently published speller 
contains sixty-one pairs of such words, and 
practically all of them are commonly used. 
Such words as there and their, eight and ate 
occur in the early grades and are the source 
of much confusion.” 


OBJECT 


The purpose of this investigation is to de- 
termine whether the “separate” or “together”’ 
method of presenting homonyms in lessons 
would result in greater spelling accuracy. In 
addition, the results show which method is 
better for each of the homonyms included in 
the investigation. The methods are also com- 
pared to see if either method gives better re- 
sults for the highest and lowest ten per cent 
of the pupils. 


DEFINITIONS 


Since the following terms, used in this 
study, have specific meanings they are defined 
for the reader. 

Cluster, refers-to a group of words having 
the same phonetic sound but differing in spell- 
ing, such as pair-pear-pare, there-their-they’re, 
bare-bear, etc. ! 


5 Thomas George Foran, The Psychology and Teaching of 
Spelling, pp. 40-41. Washington, D. C.: The Catholic Edu- 
cation Press, 1934. 
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Confusion, as used in this study, means the 
substitution of the correct spelling of one 
homonym for another within the same cluster. 

Delayed recall interval, as used in this 
study, refers to the period of time, eight 
weeks, which elapsed between the immediate 
recall and the delayed recall tests. 

Errors, include both the wrong spelling of 
the homonym called for, and the right spell- 
ing of any other word substituted for it. 

Homonym, as defined by Webster® is “a 
word having the same pronunciation as an- 
other, but differing from it in origin, meaning, 
and often, in spelling, as bare and bear . . .” 

Misspelled form, as employed in this study, 
means one of the ways in which an homonym 
is spelled incorrectly. 

Separate, means that the words having the 


' same or similar sounds are presented singly, 


only one of a cluster appearing in a lesson. 


Right spelling, refers only to the correct 
spelling of the homonym called for. 


Together, means that the two or three 
words having the same or similar sounds are 
presented adjacent to each other in the same 
lesson. 


PROCEDURE 


Thirty third grade and twenty-four sixth 
grade classes from seven Iowa and Nebraska 
school systems using the Horn—Ashbaugh 
Fundamentals of Spelling’ were used in the 
experiment. This text gives specific direc- 
tions for a test-study method of teaching 
spelling.” The unit of instruction is the list 
of words assigned for one week’s study. 
Briefly described, this method provides for a 
test over the words before study on Monday; 
a study of the words missed on this test on 
Tuesday; a second test over the words as- 
signed for the week and a review test over the 
words taught one month previously, on Wed- 
nesday; a further study on Thursday of the 
words missed on the Wednesday test; and on 
Friday a test over all the words tested on 
Wednesday. The reader should note that 
this method provides for testing both imme- 
diately after study and review testing one 
month later for all the words taught. 


S Noah Webster, Webster's New International Dictionary of 
the English Language, p. 1194. (2nd ed., unabridged). 
Springfield, Mass.: G. and C. Merriman Company, 1934. 

_' Ernest Horn and E. J. Ashbaugh, The Horn—Ashbaugh 
Fundamentals of Spelling. Chicago: J. B. Lippincott and 
Company, 1928. 

* Ibid., pp. xiii-xvi. 
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It was believed that the schools using this 
text would show less variability in teaching 
procedure than schools using a variety of 
spelling methods and textbooks. The pre- 
test, instruction periods, immediate recall 
tests, and delayed recall tests covered the sec- 
ond semester of the school year 1934-1935. 

The “separate” and “together’’ groups in 
the third grade and in the sixth grade were 
equated by means and standard deviations 
based on the pretest as follows: 


TABLE I 


EQUATION OF EXPERIMENTAL GROUPS 


Arith- 
No.of metic Standard 
Test Method Cases Mean Deviation 
Third Grade 
Immediate Separate 588 21.112 11.6473 
Recall Test Together 588 21.112 11.6473 
Delayed Separate 297 22.0370 12.0560 
Recall Test Together 295 22.0373 12.0615 
Sixth Grade 
Immediate Separate 636 37.5582 14.0856 
Recall Test Together 635 37.5582 14.0856 
Delayed Separate 279 36.8638 13.4624 
Recall Test Together 279 36.8638 13.4592 


A rotation technique provided that each 
teacher taught both methods and both lists 
of words. 


SELECTION OF WoRDS 


The lessons were composed of homonyms 
and other words. The words other than 
homonyms were selected from the second 
semester lessons in the third and sixth grades 
of the Horn—Ashbaugh Fundamentals of 
Spelling.’ Two hundred and nineteen hom- 
onyms were selected from Horn and Ash- 
baugh,’® Ashbaugh,"' Fitzgerald,'* Cooper,’” 
Masters,'* Peterson,'® and Fellows.*®° From 
these homonyms the sixty used in the third 
grade and the eighty-six used in the sixth 


® Ibid. 

"” [bid 

1 E. J. Ashbaugh, The Jowa Spelling Scales. 
Illinois: Public School Publishing Co., 1922. 
Educational Research Monographs, No. 3. 

#2 James A. Fitzgerald, ‘“‘Words Misspelled Most Frequently 
by Children of the Fourth, Fifth and Sixth Grade Levels in 
Life Outside the School,” Journal of Educational Research, 
XXVI (November, 1932), 213-218 

“ME. J. Cooper, “The 1035 Words Most Commonly Mis- 
spelled py Minnesota Pupils in Grades Three to Eight’. 
Master's Thesis, University of Minnesota, 1927. 

*H. V. Masters, A Study of Spelling Errors. University 
of lowa Studies in Education, IV, No. 4. lowa City, lowa, 
1927. 

' Evelyy F P-te~on An Anay ’ 
Master’s Thesis, University of Iowa, 1930. 

1% Samuel Fallows, A Complete Dictionary of Synonyms and 
Antonyms. Chicago: F. H. Revell Company, 1883. 
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TABLE II 


GENERAL ARRANGEMENT OF CLASSES FOR THIRD AND SIXTH GRADES ILLUSTRATING THE 
ROTATION TECHNIQUE 


Teacher Test Method Test Method Test 
7 a - hewih nines Separate Immediate Together Final 
List A Recall List B 
(8 weeks) (8 weeks) 
B __. Pretest Together Immediate Separate Final 
List A Recall List B 
(8 weeks) (8 weeks) 
Cc : wd . Pretest Together Immediate Separate Final 
List B Recall List A 
(8 weeks) (8 weeks) 
D ; eis ae : Pretest Separate Immediate Together Final 
List B Recall List A 
(8 weeks) (8 weeks) 


Explanatory notes for Table II 


a. Every teacher taught the “separate” method for eight weeks and the “together” method 
for eight weeks. 

b. In each eight-week period the homonyms were introduced during the first four weeks and 
reviewed during the next four weeks. 

c. In the third grade fifteen words were taught per week; in the sixth grade twenty words 
were taught per week. 

d. The homonyms in each grade were divided into lists A and B and were both taught in 
each eight-week period. 

e. The immediate recall test for each grade was composed of only the homonyms which had 
been studied during the first eight weeks. ; 

f. In third grade 240 words were taught, of which sixty were homonyms. In the sixth 
grade 320 words were taught, of which eighty-six were homonyms. 





grade were chosen on the basis of the data SixtH GRaDE—LiIst A 
available at the University of lowa on fre- accepted _ baring false soles 
quency of adult usage, frequency of child excepted bearing faults souls 
usage, grade ae prongs and ragoneryr™ = othe billed pause " 
homonyms used in each grade were divided  gqo build paws too 
into two lists of approximately equal diffi- ; two 
culty affect choir presence : 
al effect quire presents wale 
. > : weights 
Tino Grape—List A aisle commence profits 
aisle fore lets to Vil comments prophets wares 
I'll four let’s too isle ; wears 
isle two council rooted 
hears mail assistance counsel routed weave 
ant here's male wait assistants consul we've 
aunt : weight 
higher road ° 
billed hire rode SrxtH GRADE—LIsT B 
build — : 
hours some addition currants ladies sane 
cents ours sum edition currents ladies seine 
sense lady’s 
attendance decent stationary 
THIRD Grape—List B ; attendants descent passed stationery 
ate its meat their past 
eight it’s —_ there canvas fate slain their 
mete there 
— knew wares conve fete plane they’re 
qothes — pair wears coarse fathers praise 
grate knights pare course father’s prays vary 
great nights pear weak aes preys very 
week companies intents pent ; 
heard knows right company’s intense principal writers 


herd nose write principle writer’s 





inal 


inal 


ee, aoe 


KP = wimh~0 








September, 1937| 


ARRANGEMENT IN LESSONS 


Each cluster of homonyms was placed in 
two booklets. The ‘separate’ method book- 
let of eight lessons was so arranged that the 
first four lessons included homonyms grouped 
so that an interval of approximately two 
weeks elapsed between the studying of the 
homonyms of a cluster. The “together” 
method booklet of eight lessons was so ar- 
ranged that the first four lessons included 
homonyms grouped together. In both book- 
lets the last four lessons had no homonyms. 
The first four lessons appeared as review 
words during the second four weeks. The 
meaning and identification of each homonym 
were made clear by its use in a sentence. 
The words not homonyms were listed in col- 
umn form. 

Lesson two follows, as given in the booklets 
for the third grade, “together” method. 


LESSON Two 


lace guess change file 

match steam engine potatoes 
afraid named hundred 

herd Mr. Jones has a fine dairy herd. herd 


heard I heard the birds sing. heard 
ate He ate an apple. ate 
eight I saw eight birds. eight 
Lesson two follows, as given in the booklets 
for the sixth grade, “separate” method. 
LESSON TWo 


slight sweetheart cheaper carbon 
occurs finest poultry amounting 
advised moral assure educational 
isle The isle was captured. isle 
bearing Hewas bearingaplank. bearing 
assistance She was of assistance to assistance 


her mother. 


adieu His adieu was very formal. adieu 
wears She wears a blue dress. wears 
accepted He accepted the offer. accepted 
choir The choir sings in church. choir 
council The town council met council 


last night. 


INSTRUCTIONS TO TEACHERS 


_ Before being sent to the schools, detailed 
instructions were prepared and given a trial 
by teachers experienced in using the test-study 
method. The instructions included a general 
description of the experiment, its purpose, and 
procedure in so far as it affected the teacher 
in the classroom. Attention was called to the 
use of the method of teaching, and the 
method of study to be followed by the pupils 
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as outlined by Horn and Ashbaugh.'"  De- 
tailed directions for the giving of the pretest, 
immediate recall, and delayed recall tests 
were included. In addition, each teacher re- 
ceived special instructions with the booklet 
for the method to be taught. 

In teaching the “separate’’ method the 
teacher was instructed not to call attention to 
other words which have the same sound. If 
the pupils asked questions about other hom- 
onyms, they were to be answered but not in 
detail. The teacher was to keep in mind 
that the homonyms were not te be taught to- 
gether in this part of the experiment. 

In teaching the “together” method the 
teacher was instructed to stress the fact that 
here were two or three words with the same 
sound but different meaning. ‘These two or 
three words were to be associated together 
during the learning period. 

TESTS 

The form of the test selected was the modi- 
fied sentence containing blanks in which the 
pupil wrote the proper word. Since there 
was a possibility that pupils might guess the 
right form of the homonym to use in the sen- 
tence, it was decided to test each item twice, 
thus getting a pupil response to two situations 
in which the same meaning of the word was 
used. In making the tests and booklets, a 
certain meaning was selected as the one to use 
for the meaning of the homonym. The first 
sentence used in the booklets and also in the 
tests was called Form I. The second .was 
called Form II, the third, Form III. The pre- 
test was made up of Forms I and II for each 
list of homonyms and the immediate recall 
tests of Forms II and III. The delayed recall 
used the same forms as the pretest. Some 
illustrative sentences from the teachers’ copy 
of the third grade pretest, Forms I and II, 
are given showing the testing of the homonym 
clusters ant-aunt, and their-there. The teacher 
was instructed to pronounce the word, read 
the sentence using it, and pronounce the word 
called for again. On the pupils’ test sheets 
the homonyms to be spelled were omitted. 


Item number Sentence Used 


Form I 
ant 5. TD nmbas ran on the table. 
there 10. 5. want te O06 ......... 
aunt 17. Did you see your ~____~_-- ? 
their i. _. ppegieeigasee house is red. 


16a Horn and Ashbaugh, op. cit., pp. xiii-xvi, xxiv—xxvi 
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Form II 

aunt h. ee 

there - is the flag. 

ant 17. Did you see the -__---__ hill? 

their 26. I want __...... books. 


Immediately preceding the study of the 
booklets, each class participating was given 
the pretest which included all of the homo- 
nyms that were studied in the investigation. 
The immediate recall test was given at the 
end of the eighth week and each pupil was 
tested on all of the homonyms he had studied 
during the preceding eight weeks. 

At the close of the experiment the pretest 
was repeated. For the homonyms studied 
during the latter half of the experiment it 
was an immediate recall test and for those 
studied during the first half of the experiment 
it was a delayed recall test. 

In scoring the weekly tests the teachers 
were instructed to mark a word wrong if it 
was omitted, illegibly written, if the wrong 
word or wrong homonym was substituted, if 
apostrophes were used incorrectly, if the word 
had been erased to correct, marked over to 
correct, been begun incorrectly, crossed out 
and rewritten, or did not have i’s dotted or 
?’s crossed. 


METHOD OF ANALYZING THE RESULTS 


In the major analysis of the results the 
“separate” and “together” methods were 
compared on the basis of the number of right 
spellings. The statistical treatment used in 
reporting the main findings in this study was 
developed by Lindquist'’ and is called the 
formula for the standard error of the differ- 
ence for matched groups. 


5 2 > 2 a) 
8 diff. — 4/ (2 4 2 )—") 
= we 
8, and 6, are the standard deviations of the 
scores of the matched groups. NV, and N, 
are the number of cases in each group. “r” 
is the correlation between the pretest and the 
test being considered. 
Difference in A. M. 


critical ratio : 
Standard error of a difference 


The critical ratio was determined for each 
difference between the groups. If it was more 
than three,’ the difference was considered to 
be significant. 


"E. F. Lindquist, “The Significance of a Difference Be- 
tween Matched Groups.’’ Journal of Educational Psychology, 


XXII (March, 1931), 197-204 
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In determining which method was better 
for each homonym the comparisons were 
based on the number of pupils having two- 
right spellings.’* A formula by Yule,*’ with 
the percentage tables developed by Edgerton 
and Paterson,”” was used in determining 
whether the percentage differences in right 
spellings between the two methods for each 
homonym were significant. The formula is: 

8 diff. in percentage = \ fp,” + ?.” 

This formula is based on another by Yule 
called the “standard error of a percentage or 
proportion”. 

_ Pq 
‘P= 'N 
p= The percentage or proportion 
q = The difference between p and 1.00 
N = The number of cases 


Most FREQUENT ERRORS 


In addition to the above, a study of the er- 
rors was made for each of the methods of 
presentation in third and sixth grades. A 
sampling of papers was taken from the im- 
mediate and delayed recall tests, 150 from 
each test for each method, by selecting every 
third paper from all schools participating, un- 
til the desired number was obtained. The 
errors found on each of these test papers were 
tabulated and classified for each method of 
presentation, according to the number of mis- 
spelled forms, the most common errors with 
frequency of occurrence, and the total num- 
ber of errors for each homonym taught. 

RESULTS 

The main results reported in the following 
tables show that one method is superior for 
immediate recall and the other method has 
an advantage on the delayed recall tests. The 
results also show that certain homonyms give 
better results when taught by one method and 
certain others when taught by the other 


method. 


Tue “SEPARATE” AND “TOGETHER” METHODS 
’ COMPARED ON SPELLING ACCURACY 


The data in Tables III and IV are based 


on the number of right spell ound on the 
18 Two-right spellings are those in pupil spelled 
the homonyms right on both chances. 


p. 269. Chicago: J. B. Lippincott and Company, 1924. 

2 Harold A. Edgerton and Donald G. Paterson, “Table of 
Standard Errors and Probable Errors cf Percentages for Vary 
ing Numbers of Cases,’ Journal of Applied Psychology, X 
(1926), 378-391. 

21 Yule, op. cit., p. 257. 








‘e..2 


etter 
were 
two- 
with 
2rton 
ining 
right 
each 
a is: 


ule 
e or 


: er- 
5 Ol 


im- 
rom 
ery 
un- 
The 
rere 

of 
nis- 
rith 
Im- 


ing 
for 
nas 
‘he 
ive 
nd 
ner 


DS 


ed 
he 


lled 





S| ee ee 





September, 1937 | 


immediate recall tests. Each pupil in the 
third grade studied thirty homonyms by the 
“separate” method and thirty by the “to- 
vether” method. In the sixth grade each pu- 
pil studied forty-three homonyms by each 
method. The tables show results based on 
thirty homonyms in the third grade and on 
forty-three in the sixth grade, which are the 
number each pupil studied by each method. 
Each of the homonyms was tested twice in 
sentences, a procedure which gave the third 
grade pupils sixty chances to spell their thirty 
homonyms, and the sixth grade pupils eighty- 
six chances to spell their forty-three. 

The main results of the study are based on 
the sum of the one-right and two-right spell- 
ings. The two-right spellings are those in 
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which the pupil spelled the homonyms right 
on both chances. The one-right spellings are 
those in which the pupil was right only once. 


A graphical representation of the results is 
given in Figure 1 which shows the average 
number of right spellings, errors, and confu- 
sions found in the third and sixth grades on 
the immediate and delayed recall tests. 


Table III should be read as follows: In 
the third grade on the immediate recall test 
the 588 pupils studying by the “separate” 
method average 44.3214 right spellings out 
of sixty spelling opportunities; the 588 pupils 
studying by the “together”? method average 
45-8384 right spellings out of sixty spelling 
opportunities, etc. 


TABLE III 
THE SPELLING ACCURACY RESULTS ON THE IMMEDIATE RECALL TESTS 


No. of pupils 


No. of homonyms in test -------------------- 
No. of spelling opportunities _._...-._.---_---- 
A, Ti Se EE ciirerenttincncuscumane 
Be crcccmedenishenssnescdeandeerentieg 


Te a eink nas ie aces acaba meget tical 
Se RE AE erence eee ee eee haar ne m ReneS 


* Arithmetic mean. 
Standard deviation. 


Third Grade Sixth Grade 

Separate Together Separate Together 
_... 588 588 636 636 
sees ee 30 43 43 
———— 60 86 86 
«sus S600 45.8384 68.4403 73.0740 
naan meee 12.2809 12.3464 10.9656 
Kae .6644 .6497 7011 .6378 
ets 1.517 4.634 
aes .537 .484 
ead 2.82 9.57 


© “»” is the correlation between the pretest and the immediate recall test. 
“Standard error of the difference for matched groups. 


The results of Table III, based on the im- 
mediate recall tests, show a statistically sig- 
nificant difference (critical ratio 9.57) in the 
sixth grade, and a difference approaching sig- 
nificance (critical ratio 2.82) in the third 
grade for the “together” method. These re- 


sults indicate that for immediate recall the 
“together” method is much better than the 
“separate” method. 

The results for the pupils who took the de- 
layed recall tests are given in Table IV. 


TABLE IV 
THE SPELLING ACCURACY RESULTS ON THE DELAYED RECALL TESTS 


No. of pupils 


ef eee 
No. of spelling opportunities _._.......-.----- 
-y = RE alia cc ckcnticinnsttisimtitninmnennial 


“Arithmetic mean. 
° Standard deviation. 


Separate Together Separate Together 
Third Grade Sixth Grade 
niet ae 295 279 279 
be ae 30 43 43 
ime ae 60 86 86 
man Gane 41.0407 66.7419 64.8853 
-... 13.9465 13.7147 12.7817 13.7296 
ania: .7449 .7381 -7195 -7546 
oats 168 1.857 
1 ae -763 -799 
-22 2.32 


““r” is the correlation between the pretest and the delayed recall test. 
“Standard error of the difference for matched groups. 
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Table IV, which reveals the results on the 
delayed recall tests, shows a reverse trend 
from that in the preceding table, and in both 
grades gives an advantage to the “separate” 
method. A consistent, but not statistically 
significant, difference is found for the “sep- 
arate” method in the third grade (critical ra- 
tio .22), and in the sixth grade (critical ratio 
>,32) on the delayed recall test. 


-«5e 
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Since the final measure of spelling accuracy 
is permanent recall, greater weight is allowed 
the evidence on delayed recall than that on 
immediate recall. Considering the consist- 
ency of the findings on the delayed recall 
tests, a reasonable interpretation is to favor 
the “separate” method as giving slightly bet- 
ter results than the “together’’ method. 


TABLE V 


THIRD GRADE HOMONYM CLUSTERS GROUPED ACCORDING TO THE METHOD GIVING THE HIGHER 
SPELLING ACCURACY ON THE DELAYED RECALL TEST 


Critical 
Ratio 


Per cent 
Advantage 


Critical 
Ratio 


Per cent 
Advantage 


Group A—Separate Method 


4.29 
2.21 


2.49 


close 


clothes 


2.16 
3.49 


3.03 
3.65 


3.99 
2.99 


4.04 


EO 


Group A—Together Method 


2.60 


ee ener, 


Group B—Separate Method 


Group B—Together Method 


build 


8 
2 (separate) 


cents 7 (separate) 
14 


6 (separate) 
6 (together) 


hire 
higher 


3 (separate) 
7 (together) 








( ONFUSION 


The confusions which occur in spelling 
homonyms are a source of many errors. On 
the immediate recall test, the “separate” 
method has enough more confusions than the 
“together’’ method that the difference be- 
tween them is statistically significant. On 
the delayed recall tests, the differences be- 
tween the methods, although not statistically 
significant, show that the “separate” method 
has more confusions in the third grade and 
the “together’”’ method has more in the sixth 


grade. 


JOURNAL OF EXPERIMENTAL EDUCATION 


[Vol. 6, No. 1 


THe Metuop GIvinc BETTER RESULTS 
FOR EAcH HoMONYM 


The method which shows the better results 
for each homonym was determined as follows: 
The number of pupils who spelled each hom- 
onym correctly on the delayed recall test, on 
both of the opportunities to spell it, was de- 
termined for each method. This number was 
then converted into a percentage by dividing 
the number right by the number of pupils 
spelling the homonym by that method. The 
difference in percentage between the methods 
was then found for each homonym and com- 


TABLE VI 


SIxTH GRADE HOMONYM CLUSTERS GROUPED ACCORDING TO THE METHOD GIVING THE HIGHER 
SPELLING ACCURACY ON THE DELAYED RECALL TESTS 


Per cent Critical Per cent Critical 
Advantage Ratio Advantage Ratio 
Group A—Separate Method 
OE csi cenraintinen 7 empenies ............ 1§ 2.56 
excepted —_- setsintaieainin a 3.67 ER 12 2.04 
SD ccccacacctmamaue 13 2.16 i cecil tn cnlaehae ees 5 
EA ee one. 2 RT See ee 11 7 
a 11 2.12 TE IE 13 2.24 
EE | sncnnatinintiines 5 GE wien nmnanss 26 4.59 
a i alin 12 2.17 
| eee eee 1 
Group A—Together Method 
A ee 8 | IE or 3 
effect ._..-- SS 2.50 _ SE eee 6 
| a 1l 2.14 
OT: RS: 14 2.33 — 
Spee 6 
ERS eer 7 (separate) 
Group B—Separate Method 
I xinncmneanccie 4 I 56 nice is ainiehyiseevein 1 (together) 
GE waneddwancinns 5 RNIN. 5::ci-ccintascneenaiinienintanainneiah 6 
s DE eicntidbenedmbaas 2 
0 ee 5 
IEE, \isseunthiccencens 0 I eared ec 3 
I co adiachinn cctnchekiniieiah 6 
attendance __..____--- 1 (together) — 
SIN, ccecateinteeanecem SEE ditdansibaaebnin 3 
s SET sidosvccleategdbtoninttiatind: 4 
Earn tera ae ae 1 
ners 5 ID cnc. cone oeenengaeen 7 
GE hctncionnnainoinn 5 
I icicata tate inte mitaniiiaiins 8 
DED Anitctimsiodimaneon 5 es ee 0 
SID -siiciccossdakbes dogiaicanatbacunand 6 
CIEEEE iccncnswcnnun 6 
a ee 3 2 
: CE Sdcudinenwensene 5 
SI. 4: ncsinttimnendieiinsitnmeanidl 8 : 
a eee 7 wats ...............- 
ee ane 4 IE: seit inrrinwictnnns 1 (together) 
O_o 9 WEED anncacacssacsenn 6 
aa 5 =e ee. 5 
St acasustsebdnsaca “O . 


plane 
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Group B—Together Method 


Per cent Critical Per cent Critical 
Advantage Ratio Advantage Ratio 
Ce 13 2.16 
en 6 (separate) 
Group C 
Per cent advantage Per cent advantage 
Separate Together Separate Together 
aisle qc<ccsmaesoscnn 3 NE  acisnccecibicaeaneinieiatn. a 
————E—EE— 6 | aes ee 1 
1G acoctensconmmmacee 4 P 
presence sides 3 
Pk eR 1 presents - 1 
SUES  cewesnacsun 0 yxjnes 
principal ........ ie 4 
De. cisscnciemin ce nnitanninig 1 ID. sekeenacemeae 2 
GOED cetiet ctnncns 0 a 
CED cecccdnmncnnane 0 UI babies as scbtidmeuniioidicanetetian 1 
er 4 WE sictouda~wald amiictiion 1 
IE iii ci tenticcdentbiciciniee 1 aR ease 4 
og 1 0 a re 2 
a ee 7 NG sc thas tee sidetmniind 1 
AGERE wrnetiennmenme 8 WEL. ct ceteaia cae 3 


pared, using the formula by Yule** to see if 
the difference between them is statistically sig- 
nificant. Critical ratios were determined for 
all the differences of thirteen per cent or more. 
The homonyms taught in each grade are 
grouped into three divisions on the strength 
of the evidence for the “separate” or “to- 
gether” method. Group A includes those 
homonym clusters in which one member of a 
cluster has eleven to twenty-six per cent ad- 
vantage for one method or the other; group B 
includes those homonym clusters for which 
there is five to nineteen per cent advantage 
for the cluster for one method or the other; 
and group C includes those homonym clusters 
for which there is less than five per cent ad- 
vantage for one method or the other. 
) The results of the delayed recall tests given 
in Table V show that 53.56 per cent of the 
third grade homonyms have appreciably 
higher accuracies, five to twenty-six per cent, 
when taught by the “separate” method and 
39.28 per cent when taught by the “together” 
method. 
/The results of the delayed recall tests 
given in Table VI show that sixty per cent of 
the sixth grade homonyms have appreciably 
higher accuracies, five to twenty-six per cent, 
when taught by the “separate” method, and 
ten per cent when taught by the “together” 
ethod. 
Gains in spelling accuracy during the de- 
layed recall interval are recorded for nineteen 
= Yule, op. cit., p. 269. 


homonyms in the third grade and thirty in 
the sixth grade. Of these homonyms more 
than three times as many were taught by the 
“separate” method as by the “together”. 


J THE MetHops COMPARED FOR THE BEST 
AND Poorest PUPILS 


The effectiveness of each method was de- 
termined for the highest and lowest ten per 
cent of the pupils. On the immediate recall 
tests the highest and lowest ten per cent of 
the pupils in both grades have a consistent, 
although statistically : advantage 
for the “together” method. 

On the delayed recall tests the highest te 
per cent of the ils i j 
equally well by both methods and the lawest 
tem per Cént of the pupils do slightly better 
by the “together” method. In the sixth grade 
on the delayed recall tests the highest _and 


ry San ong roe gO Tr eames 
advantage for the “separate” method. 


These findings are in agreement with the 
main findings of the investigation given in 
Tables III and IV. The only exceptions are 
found in the delayed recall tests in third grade 
which show that the highest ten per cent of 
the pupils do equally well by both methods 
and the lowest ten per cent do slightly better 
by the “together” method. 


CAUSES OF SPELLING ERRORS 


Confusions account for fifty per cent of the 
errors found in approximately one third of 
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the homonyms taught in the third grade and 
in one half of those taught in the sixth grade. 
Another homonym of the cluster is the most 
frequent form of error found for approxi- 
mately three fourths of the homonyms taught 
in third grade and in four fifths of those 
taught in sixth grade. 

A careful examination of the tabulation of 
the misspelled forms shows that many of the 
errors were phonetic, that carelessness in writ- 
ing, inversion, omission, and insertion of let- 
ters, misuse of the apostrophe, and omission 
of capital letters, as in other words, are com- 
mon causes of the errors in spelling hom- 
onyms. 


INCREASES AND DECREASES IN SPELLING 
ACCURACY AND CONFUSIONS DURING 
THE DELAYED RECALL INTERVAL 

One of the problems arising in this study 
was to account for the differences in loss in 
maintenance of spelling accuracy shown by 
the two methods for the period between the 
immediate and delayed recall tests. The evi- 
dence in both the third and sixth grades shows 
that the “together” method gives superior 
spelling results on the immediate recall tests. 
The delayed recall tests, given eight weeks 
later, show that the “separate” method has an 
advantage. Just what has happened to cause 
this reversal to take place? In Table VII 
are shown the decreases in spelling accuracy 
between the immediate and the delayed recall 
tests for the third and the sixth grades. 

Table VII shows that the pupils taught by 
the “separate” method have 2.1234 homo- 
nyms per pupil smaller decrease in spelling 


TABLE VII 
THE INCREASES AND DECREASES IN SPELLING ACCURACY DURING THE DELAYED RECALL INTERVAL 
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accuracy in the third grade, and 3.7814 hom- 
onyms per pupil smaller decrease in the sixth 
grade than do those taught by the “together” 
method. 

Table VIII presents the relation that was 
found to exist between the increases and de- 
creases in spelling accuracy and confusions. 
Were the increases in errors which developed 
during the delayed recall interval caused by 
confusions? 

A comparison of the increases in errors 
during the delayed recall interval given in 
Table VIII shows that the “together’’ method 
has 2.1234 more errors per pupil in the third 
grade and 3.7814 more in the sixth grade than 
does the “separate’’ method. Are these in- 
creases in errors caused by added confusions? 
If so, how do the two methods compare? 

In the third grade, as shown by Table VIII 
the “separate” method has a decrease of .76 
confusions per pupil (23.82 per cent of the 
increase in errors) during the delayed recall 
interval, and the “together” method has an 
increase of .43 per pupil (8.08 per cent of the 
increase in errors). The net result of this 
decrease in confusions in the “separate” 
method and the increase in confusions in the 
“together” method gives the “separate” 
method an advantage of having 31.9 per cent 
smaller increase in errors caused by confu- 
sions during the delayed recall interval than 
does the “together” method. 

In the sixth grade, confusions cause 41.58 
per cent of the increase in errors by the “sep- 
arate” method and 55.10 per cent of the in- 
crease by the “together” method during the 
delayed recall interval. These results make 






Immediate Recall Delayed Recall 
Separate Together Separate Together 
Third Grade 

BN IN: cinienrguniiesenintinnuangneneniebiniatiiaminipiapataisntet 297 295 297 295 
EE ree 30 30 30 30 
No. of spelling opportunities -.....-..----------- 60 60 60 
is Gee SE GED necccccnnccanecokeonnseuaee 44.4040 46.3593 41.2088 41.0407 
a ee 3.1952 5.3186 


EE Sane aaeren tees eee 





2.1234 








No. of 
No. of 
No. of spelling opportunities 


upils 


CR OO Se 
POSTERS DOCWSEE THON ...nncccccncccadcdeces«< 
ED TE GOINGS a cttonccacubtcnesdudaben « 





CEE 0D TE ocancitsaoudanena 


ou SOP 279 279 279 
one 43 43 43 
— 86 86 86 
--- 68.7634 70.6882 66.7419 64.8853 
mene 2.0215 5.8029 


3.7814 


























oe TO ee 


3 No. of pupils 


>No. of spelling opportunities 
De, TR nies erisbcnsdncddnnccunminnaue-« 
mA. M. I i cnttisclensetihein innit anand einen as a0 
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TABLE VIII 
THE RELATION BETWEEN ERRORS AND CONFUSIONS ON THE IMMEDIATE AND DELAYED RECALL TESTS 


Delayed Recall 
Separate Together 


Immediate Recall 
Separate Together 


Third Grade 


No. of nomonyms in tests ........... 


Confusions divided by errors* 
A. M. increase in errors 
in increase 
A. M. increase in confusions __--------~- ile 


Increase in confusions divided by increase in 


errors 
ise CG IE cticncncenninine nie heres gaia 


— 295 297 295 
cw ee 30 30 30 

ne 60 60 60 

--. 15.596 13.6407 18.7912 18.9593 

cn ae 5.4644 6.5926 5.8881 

ome SOE 40.02 35.07 31.65 

Ghe 3.1952 5.3186 

ae 2.1234 

osbiie (decrease) .76 43 

mia (decrease) 23.82 8.08 


31.99 


Delayed Recall 
Separate Together 


Immediate Recall 
Separate Together 


Sixth Grade 


} EE Se ee ae eS nee 


No. of homonyms in tests -__----~-- 
No. of spelling opportunities 


Ee Fhe Al er 


A. M. Confusions 
Confusions divided by errors’ 


A. M. inereass im errors ...................... 


Diff. in inerease 
A. M. inerease in confusions 
Increase in confusions divided by increase in 

errors 
Per cent advantage 


ee 279 279 279 

on OE 43 43 43 
on. aa 86 86 86 
--- 17.2366 15.3118 19.2581 21.1147 
--- 9.4265 7.3548 10.2688 10.5699 
--. 54.69 48.00 53.32 50.07 
dot 2.0215 5.8029 
— 3.7814 
on 84 3.22 
ane ' 41.58 55.10 

13.52 


"Per cent of the errors which are confusions. ; ; 
"Per cent of the increase in errors which are caused by increase in confusions. 


it clear that confusions in the “together” 


> method during the delayed recall interval 


cause 13.52 per cent more of the increase in 
errors than they do for the “separate” method. 


It may reasonably be assumed, then, that 


» the superiority in maintenance for the “sep- 


arate’ method is caused by the smaller in- 
crease in errors which occurred during the de- 
layed recall interval. This smaller increase 
in errors is partly accounted for in the third 
grade by the disappearance of confusions 
already present, and in the sixth grade by 
fewer confusions developing during the de- 
layed recall interval. 


DISCUSSION OF FINDINGS 
_ The evidence points clearly to two seem- 
ingly contradictory findings: 
_ 1. The “together” method is superior on 
immediate recall. 


2. The “separate” method has an advan- 
tage on delayed recall. 


The results offer an explanation of the 
findings of Pearson,?* who discovered in 1911 
that the “together” method was better, and 
Finkenbinder,?* who found in 1923 that the 
“separate” method was better. The pub- 
lished material on Pearson’s study shows 
that he gave his final test ten days after the 
instruction period ended. This would hardly 
measure all the loss that might occur if a 
longer delayed recall interval were employed. 

Finkenbinder gave an immediate recall 
test at the end of the instruction period and 
a delayed recall test one month later. 

The results of the studies by Pearson and 
Finkenbinder and those of this investigation 
lend themselves to consistent inference. 
They indicate: (1) that there is something 
inherent in the “together” method which 
causes the pupils to spell better on immedi- 
ate recall than pupils who studied by the 
“separate” method; and (2) that there is 


%3 Pearson, op. cit. 
™% Finkenbinder, op. cit. 
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something inherent in the “separate” method 
which enables pupils to retain their spelling 
ability better during a delayed recall interval 
than do the pupils who study by the 
“together” method. 

The results show that the “together” 
method has a greater increase in confusions 
and errors than does the “separate’’ method 
during the delayed recall interval (Table 
VII). This fact indicates that the passage 
of time adversely affects the pattern of a 
homonym cluster when taught by the “to- 
gether” method more than it does when 
taught by the “separate” method. It ap- 
pears that the learning of two or more 
homonyms at the same time (together 
method) sets up patterns in the nervous sys- 
tem of which an important phase is the con- 
trasts. These contrasts are made up of word 
pictures, meaning, spelling, use in sentences, 
etc., and are tied together by sound and time 
sequence. Such contrasts appear to facilitate 
immediate learning as is illustrated by such 
words as fate-fete, canvas-canvass, their- 
there. The distinctions are relatively easy to 
remember. With a lapse of time, however, 
these contrasts fade, forgetting takes place, 
and the differences in meaning or use become 


shadowy or even transpose themselves, re- 
sulting in an increase in confusions as well 
as in errors. 

On the other hand, the “separate’’ method 
sets up a pattern in the nervous system cen- 
tered around one word with its sound, mean- 
ing, use, spelling, etc., all forming radiations 


of the pattern. When another member of the 
cluster is studied it may or may not be tied 
to the first in the pattern. Frequently it is 
difficult to bring the members of a cluster 
together in meaning, as use so dominates the 
patterns of sound that the right form appears 
upon stimulus with no confusions resulting. 
The distinctions between members of a 
cluster are not stressed as such. Since the 
patterns are more isolated, the sound may 
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cluster, the patterns of each homonym fading 
rather than the differences between them 
fading and causing them to be confused. 
This offers a reasonable explanation of why 
the “separate” method has fewer new con- 
fusions than the “together” method on the 
delayed recall tests. 

In the “separate” method the pattern has 
opportunity for additional enrichment. In 
comparison, the pattern for a cluster of 
homonyms set up by the “together” method 
is centered around the cluster and is largely 
based on the contrasts between the homo- 
nyms. Opportunities for strengthening these 
contrasts by incidental learning are few as 
the homonyms appear in sentences separately. 
‘The net effect, then, is for the incidental 


eo these homonyms in their reading, 
language, writing, and other subjects to 


strengthen the ‘separate’? method more than 
the “together”. 


This appears to be a reasonable explana- 
tion of the fact that the “separate’’ method 
had over three times as many homonyms 
showing gains in spelling accuracy during 
the delayed recall interval as did the “to- 
gether” method. 

The “separate” method, then, appears to 
be a better method for the teaching of 
homonyms for permanent recall because it 
depends upon an enriched individual pattern 
for each word rather than upon contrasts 
between the members of a cluster. 

Many of the homonyms appear in the 
child’s vocabulary before he starts to school. 
The best method for the teaching of these 
homonyms will probably have to be deter- 
mined for each homonym. This study indi- 
cates the method which gives better results 
for a number of them. 

The spelling of those homonyms appearing 
in the child’s vocabulary after he starts to 
school is probably best taught when they 
appear. This method would enable some of 
the homonyms to appear in lessons several 


call up only one response, thus causing the ~/grades_apatt, which would seem desirable as 
I 


pupil to use the same spelling whenever that 
sound appears. This explains how confu- 
sions may occur, and it may also account for 
the finding that the “separate” method has 
more confusions on the immediate and de- 
layed recall tests than has the “together” 
method. 

In the “separate” method the lapse of time 
affects all the patterns of the homonym 


would enable the pupil to learn one of the 
homonyms, and after its spelling and use are 
well fixed, to learn the other one. 

The slightly better results found in this, 
study for the “separate” method came as a 
result of teaching the homonyms in lessons 
with approximately one week intervening be- 
tween the study of one homonym and the 
appearance of the other member of a cluster. 
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it is the opinion of the author that if the 
experiment could have been set up so that a 
longer time, for example, one semester, had 
intervened between the study of the homo- 
nyms, the results would have shown that the 
‘separate’ method had fewer confusions and 
larger differences in spelling accuracy in its 
lavor. 

“The possibility should not be overlooked 
that a combination of the two methods, 
“separate” and “together”, might give better 
results than either alone. At least, it appears 
that if the homonyms are introduced sepa- 
rately and confusions appear, then they may 
be brought together with profit, to contrast 
their differences and to set the pupil right. 
This phase of the problem of homonym in- 
struction might profitably be investigated 
further. 

On the other hand, if the spelling of these 
homonyms is to become a fixed habit, so that 
when one needs to use them no doubt will 
arise as to which homonym to spell, the best 
way psychologically seems to be to fix them 








through use in the setting in which they will 
be called for. This would suggest the 
“separate” method. 


CONCLUSIONS 


Based on the evidence for the specific 
homonyms included in this study the follow- 
ing conclusions are believed to be warranted. 

1. The “together” method results in 
greater immediate spelling accuracy than 
does the “separate” method. In the third 
grade the difference was not statistically sig- 
nificant (critical ratio 2.82) but in the sixth 
grade the difference was statistically signifi- 
cant (critical ratio 9.57). This probably 
means that the pupils who studied these 
homonyms by the “together” method, which 
encouraged contrasts between them, were 
able to acquire the right spelling and hold it 
for a short time better than could the pupils 
taught by the “separate” method, which en- 
couraged them to learn each word by itself. 

2. A consistent advantage, although not 
statistically significant, was found in the 
third grade (critical ratio .22) and in the 
sixth grade (critical ratio 2.32) for the “sepa- 
rate” method on the delayed recall tests. 
The author believes that the final measure of 
spelling accuracy is permanent recall, and for 
that reason gives greater weight to the evi- 
dence on delayed recall than to that on imme- 
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diate recall. Without statistically significant 
evidence, a final decision as to the method 
better for delayed recall should be withheld. 
Although not statistically significant, the 
weight of evidence now available on the 
teaching of homonyms for permanent recall 
indicates that the ‘“‘separate’’ method on the 
average is better than the “together” method. 
Exceptions must be made in the cases of cer- 
tain homonyms since they give better resuits 
when taught by the “together’’ method. 


The pupils who learn the homonyms by the 
“together” method hold them for a time by 
the contrasts between them; after a period of 
time elapses, these contrasts become indis- 
tinct and hazy and forgetting occurs. The 
pupils who learn the homonyms by the “sep- 
arate” method hold them by patterns which 
are centered around the homonym. ‘The pat- 
terns fade as a whole, thus causing forgetting, 
but not so much as results from the 
“together” method. 


3. The “separate” method has enough 
more confusions than the “together” method 
on the immediate recall tests in the third and 
sixth grades that the difference between them 
is statistically significant. On the delayed 
recall test, the difference in the number of 
confusions, although not significant, shows 
that the “separate” method has more confu- 
sions in the third grade and the “together” 
method has more in the sixth grade. 


4. A small and consistent difference, 
although statistically insignificant, was found 
in both grades for the highest and lowest ten 
per cent of the pupils for the “together’’ 
method on the immediate recall test. On the 
delayed recall test, the highest ten per cent 
of the third grade pupils did equally well by 
both methods, and the lowest ten per cent of 
the pupils did slightly better by the “to- 
gether” method. In the sixth grade the high- 
est and lowest ten per cent of the pupils had 
a small advantage, not statistically significant, 
for the “separate” method. 


5. Confusions account for fifty per cent of 
the errors found in approximately one third 
of the homonyms taught in the third grade 
and in one half of those taught in the sixth 
grade. Another homonym of the cluster is 
the most frequent form of error found for 
approximately three fourths of the homonyms 
taught in third grade and in four fifths of 
those taught in sixth grade. 















te 
tN 


6. The superiority in maintenance of spell- 

ing accuracy found for the “separate” method 
is accounted for by the smaller increase in 
errors which occurred during the delayed re- 
call interval. This smaller increase in errors 
is partly accounted for in the third grade by 
the disappearance of confusions already 
present, and in the sixth grade by fewer con- 
fusions developing during the delayed recall 
interval. 
Prt Gains in spelling accuracy during the 
delayed recall interval are recorded for nine- 
teen homonyms in third grade and thirty in 
sixth grade. Of these homonyms more than 
three times as many were taught by the 
“separate” method as by the “together”. 

8. Of the sixty homonyms taught in the 
third grade, an eleven to twenty-six per cent 
advantage for one member of a cluster was 
found for teaching twenty-two of them by 
the “separate” method and twelve by the 
“together” method. Of the group which 
showed from five to ten per cent advantage, 
ten gave better results by the “separate” and 
twelve by the “together” method. The re- 
maining four showed little advantage for 
either method. 

The third grade homonyms which showed 
strongest evidence for the “separate” method 
are: 


close knew meat right 

clothes new meet write 
: mete 

grate knights wares 

great nights pair wears 

are 
heard knows ee 
herd nose 


The third grade homonyms which showed 
strongest evidence for the “together” method 
are: 


aisle hours to 

I'll ours too 

isle two 
ant 

hears aunt 

here’s 


Of the eighty-six homonyms in the sixth 
grade, an eleven to twenty-six per cent’ ad- 
vantage for one member of the cluster was 
found for teaching fourteen of them by the 
“separate” method and seven by the “to- 
gether” method. Of the group which showed 
from five to ten per cent advantage for the 
cluster, thirty-seven of them showed better 
results by the “separate” method and two by 
the “together” method. The remaining 
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twenty-six showed little advantage for either 
method. 

The sixth grade homonyms which showed 
strongest evidence for the “separate”? method 
are: 


accepted canvas companies stationary 
excepted canvass company’s stationery 
adieu choir fate 
ado quire fete 


The sixth grade homonyms which showed 
strongest evidence for the “together’’ method 
are: 


affect council their 
effect counsel there 
they’re 


The author has been unable to develop 
from the data available any satisfactory 
hypothesis which would explain the fact that 
certain homonyms show higher spelling accu- 
racies when taught by the “separate” method, 
while others have higher accuracies when 
taught by the “together” method. There is 
perhaps a common element or principle 
which, if known, would explain the behavior 
of these homonyms. Two possible approaches 
for its discovery would be: (1) to repeat 
such a study as this with a different set of 
homonyms and by comparison with the re- 
sults of this study attempt to find a common 
element between them; (2) to follow by 
means of case studies the experiences of the 
pupils in learning to spell the homonyms by 
each method. 

Finally, the observation should be made 
that the pupils learned to spell by both meth- 
ods, and that such average differences as 
existed between the methods were small. 
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A COMPARISON OF FOUR TYPES OF SPELLING TESTS 
FOR DIAGNOSTIC PURPOSES 


Jor E. Moore 
George Peabody College for Teachers 


The spelling ability of children is deter- 
mined ordinarily by tests of some kind. The 
basic assumption behind most of these diag- 
nostic tests is that they will* determine 
whether or not a child can:spell a word in a 
specific situation. In turn, the way a spelling 
test predicts a pupil’s ability to spell a word 
depends upon two factors: the reliability of 
the test as a diagnostic instrument and the 
variability of the subject taking the test. 
Obviously, then, it is important to know what 
type of test should be applied in order to 
obtain the most reliable measure of a pupil’s 
misspelling. 

The purpose of this study was to investi- 
gate the variability of sixth grade pupils on 
four kinds of spelling tests. The test words 
and the procedure used in this study were 
taken from a previous study by Northby’ 
who investigated five types of spelling tests. 
The oral form used by Northby was omitted 
in the present study. 


The subjects consisted of 102 high-sixth 
grade pupils from three public schools in 
Nashville, Tennessee: 38 pupils from the 
Tarbox School, 40 from Jones School, and 
24 from Clemons School. The entire high- 
sixth grade enrollment, including both boys 
and girls, in each of the schools investigated 
was used in this study. 


The twenty words used in the list presented 
in this study were selected by Northby from 
the Iowa Spelling Scales developed and pub- 
lished by E. J. Ashbaugh.* With respect to 
these words Northby says, “Approximately 
half of the twenty words were of fifty per 
cent difficulty for average sixth graders, one- 
fourth of greater than fifty per cent difficulty, 
and one-fourth less than fifty per cent diffi- 
culty.’”* The list of words used by Northby, 


1 Arwood S. Northby, “A Comparison of Five T. of 
Spelling Tests for Diagnostic Purposes,’ Journal of uca- 
tional Research, XXIX (January, 1936), 339-46. 

2? E. J. Ashbaugh, /owa Spelling Scales. Bloomington, IIli- 
nois: ublic School Publishing Co., 1922 

* Op. cit., p. 341. 
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together with the per cent of average sixth 
grade pupils spelling them correctly, is given 
below: 

ninety (54%) 
succeeded (54%) 
similar (54%) 
nickel (51%) 
admission (50%) 
carnival (50%) 
appearance (51%) 
excitement~ (50%) 
planned (45%) Saturday (87%) 
finally (44%) island (93%) 


Since the tests employed in this analysis 
were the same that Northby used in his study, 
it is obvious that the present investigation 
follows the earlier study in purpose. Both 
seek to determine whether there is any basis 
for anticipating a significant difference in 
favor of any one of these four most widely 
used forms of presenting spelling materials. 
Probably the notable difference in the two in- 
vestigations is that, whereas 102 children 
were included in the present study, Northby 
made his comparisons with only 43 cases. 

A brief description of the four tests, as de- 
veloped by Northby, is as follows: 

1. Story form: As the title suggests, the 
twenty words were worked into a short story 
which follows: 

All week we had planned to go to the car- 
nival on Saturday. It seemed that the day 
would never come, but finally it did. Right on 
schedule that morning about ninety of us suc- 
ceeded in crowding into the cars that had been 
provided for our convenience to take us to the 
island, 

Everyone was in a good humor. Upon ar- 
riving there we found various shows and many 
fine exhibitions to attract attention. For some 
of these the admission price was a nickel, but * 
occasionally we found some that were free. 
The appearance of several clowns added 
greatly to the excitement. No two were 
similar. 

There was much music, but we liked one 
orchestra better than the others. 


2. Time dictation tests: Though the time 
dictation tests used in this investigation were 


orchestra (55%) 
occasionally (33%) 
convenience (23%) 
exhibition (23%) 
schedule (9%) 
arriving (63%) 
various (59%) 
humor (78%) 
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also the same as those used by Northby, the 
method of presenting them differed slightly 
due to the conditions under which the present 
study was made. The tests used were given 
to the subjects in a large study hall where 75 
or more other pupils were preparing their les- 
sons and not taking the spelling test. As a 
result it was difficult to make the sentences 
understood without repeating them. Each 
sentence in the dictation was pronounced 
twice rather than once (as in Northby’s 
study), and the time limits given herewith 
were checked on a stop watch. As soon as 
the signal stop was given the next sentence 
was immediately pronounced twice, and the 
time limit determined for it was observed. It 
will be seen that according to this method the 
pupils had a longer pause than would be the 
case in the continuous dictation observed in 
the formal time-dictation sentences, but the 
actual time used in writing, as determined by 
Brueckner and Melby* and adopted by 
Northby, is the same. 


The twelve sentences used in this test, with 
the time limit for each sentence, are as 
follows: 


A nickel was the admission price -___.______- 
On Saturday we planned a trip -.__-_-_-____- 
The island was ninety miles wide ___________- 
The exhibition finally began -_.__._.__________- 
The orchestra played various kinds of music _- 
We saw the schedule of arriving trains ----_-- 
The carnival was similar to many others —__-_- 
Bad humor was occasionally shown —-_.-__--_- 
John succeeded in winning the prize ____._--_- 
His appearance caused much laughter —______-_ 
The excitement was great ..............__-- 
It was planned for the convenienve of the people 


3. List form: The list of words used in 
this section of the test was the same as that 
previously given. The method of giving the 
test was the same as that used by Northby: 
each word was pronounced twice before the 
pupils wrote. 


4. Multiple choice: The multiple choice 
form was the same as that used by Northby. 
A mimeographed sheet containing the four 
most common misspellings of each word was 
included with the correct spelling. The pupil 
had five possible choices for each word. The 


*Leo J. Brueckner and Ernest O. Melby, Diagnostic and 
ag A Teaching, pp. 385-87. Boston: Houghton Mifflin 
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subject responded by making a mark under 
the word that he felt was the correct spelling 
for the one represented in each line. The fol- 
lowing material will enable the reader to get 
a clear idea of what the multiple choice 
section was like: 

nickle, nickel, nicle, nickele, nichle 

humer, humor, humur, humar, hummor 


Procedure 

The four tests were administered one 
after the other in the order here presented, 
on the same day, approximately thirty-five 
minutes being required to take the entire set 
of tests. The writer administered all tests 
in order to hold constant the factor of pro- 
nunciation. 


Results 

Table I shows that when the criterion 
of difficulty is applied, the story form is 
perhaps the best measure of a pupil’s spell- 
ing ability. The difference in the story form 
over timed dictation is slight. 

The differences between the means were 
checked to see if they were significant by 


Number of Time in 

letters seconds 
wis iachda eld ata 27 25 
OE SS a) 24 2 
Pe ee EE Pa a ae 27 25 
aR FI LEB. RE ok 25 23 
Be eR eee 37 34 
ii i ae a 32 30 
hc FEES Rr Pe 33 31 
DICE Seep Raa eee rer 28 26 
Rep. te LT, ae 30 28 
siniacieiladiaclan si eaciiansseleteidlniedandae. 31 29 
siti its ince erceretbaeteceatbes aidaes 21 20 
SP RE ae ee 40 37 


dividing each difference by its Standard 
Error of the Difference. The differences be- 
tween the means, and differences divided by 
the Standard Error of the Differences are 
presented in Table IT. 


TABLE I 


MEAN SCORES AND STANDARD DEVIATIONS ON 
Four TYPES OF SPELLING TESTS 


Mean Standard Northby’s 
Type of Test Score Deviation Data 
Story Form ----- 6.62 3.52 8.51 
Timed Dictation-- z 24 3.70 8.32 
List Form ------ 7.90 3.50 10.02 
Multiple Choice __11.36 3.46 13.83 
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TABLE II 


DIFFERENCES BETWEEN THE MEANS OF THE 
Four SPELLING TESTS TOGETHER WITH THE 
STANDARD ERRORS OF THE DIFFERENCES 
BETWEEN MEANS, CORRELATED MEASURE, 
AND THE RATIOS BETWEEN THE Two* 


Difference 
S.D.of S. D. of 
M, — M: Diff. Difference 
1.74 A9 9.67 


Type of Test 

Multiple Choice ___- 
Story Form 

Multiple Choice ____ 
List Form 

Multiple Choice ___- 50 
Timed Dictation 

Story Form ----- 7 .28 .50 
List Form 

BES WOOD wccenns « 51 
Timed Dictation 

OEE Ea .66 51 
Timed Dictation 


*In the calculation of the means and Standard Deviation 
a step interval of 2 was used. 


3.46 A9 7.06 


8.21 
2.58 
1.22 


1.29 


Whenever a difference between two means 
is equal to or greater than three times its 
Standard Error of the Difference, this differ- 
ence is considered to be statistically signifi- 
cant. In only three of the six instances is 
the difference between the several means 
equal to or greater than three times its Stand- 
ard Error. In Northby’s study, to the con- 
trary, the differences obtained were reliable 
in all except one instance, that being the dif- 
ference between the story form and the timed 
dictation form. Incidentally, it will be seen 
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from Table II that in the present study the 
difference between the story form and the 
timed dictation form is less reliable than any 
of the others. There is another marked dif- 
ference in the results obtained in this study 
and those obtained by Northby. Northby 
found a statistically reliable difference be- 
tween the list form and the timed dictation. 
The differences obtained in this study between 
the story form and the list form, and between 
the list form and timed dictation were not 
reliable. 

The percentage of the 102 pupils spelling 
each word correctly, tabulated according to 
the type of test, is given in Table III. The 
results obtained by Ashbaugh from Iowa 
pupils in the sixth grade are given at the 
right under the column labeled “Iowa 
Standards.” 


Since Ashbaugh’s list was given by the list 
form method, it should be compared with that 
method as used in the present study. It will 
be seen from Table III that there is an ap- 
preciable difference between Ashbaugh’s re- 
sults and the results obtained in the present 
study when the list form was used. In only 
four words of the twenty did the present sam- 
ple of Tennessee children exceed the Iowa 
group in correct spelling. It is impossible to 
account for the difference unequivocally. 
However, it may have resulted from a differ- 
ence in textbooks, the amount of emphasis 


TABLE IIT 
PERCENTAGE OF PUPILS SPELLING CORRECTLY THE TWENTY WORDS IN THE Four TESTS 


Story 


Word Form 


ninety 
succeeded 
similar 
nickel 
admission 
carnival 
appearance 
excitement 
planned 
finally 
occasionally 
convenience 
orchestra 
exhibition 
schedule 
arriving 
various 
humor 
Saturday 
island 87. 


*Determined by list form method. 


Borsarwon SRS 


CBOE AWDOOOCOCO-A1% OBWOWDROS 
WWE CIDDOWOOOA1-1WNWDWOr ODD 


SESSG anew BlSoa 


Dictation 


Iowa 


Multiple 
Standards* 


List Choice 


66 
21 
9 
45 
54 
37. 
11 
36 
70 
60 


6 
5 
.8 
1 
9 
2 
.7 
.3 
.6 
8 
9 
9 
9 
9 
8 
5 
5 
3 
6 
2 


sawrm-a 


SSSR mans 
NOOCNWKROWOWRACAHAKADAAOCWWDAWD 
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placed upon spelling, divergent methods of 
teaching spelling, difference in the time of 
the school year when the two experiments 
were conducted, variations of conditions un- 
der which the studies were made, or from 
combinations of these factors. 

In the correct spelling of the twenty words 
the story form has the lowest percentage of 
correct spelling in fourteen instances. The 
multiple choice form has the highest percent- 
age of correct spelling of the four tests given. 
In this connection it should be remembered 
that the multiple choice form was the only 
recognition test given. Of the three recall 
tests the list form shows the highest percent- 
age of correct spelling. 

The total number of different words spelled 
by all the subjects was 2040. Since these 
words were spelled in four different situations, 
the total number of spelling attempts by all 
pupils was 8160. Of these, 5990 words were 
misspelled. The number of misspellings, 
classified as to forms, are presented in 
Table IV according to type of test. 





TABLE IV 


NUMBER OF TIMES THE WorDS WERE MIs- 
SPELLED ON EACH OF THE Four ForRMS 


Story Timed List Multiple Total 
Form Dictation Form Choice Errors 
1415 1359 1289 927 5990 


The data indicate that among these four 
types of tests pupils made most errors on the 
story form of spelling test and fewest errors 
on the multiple choice form. However, since 
the trend in reduction of errors is consistent 
throughout the four tests in the order given 
to the subjects, there may be some indications 
that there was a practice effect operating 
throughout the series. Northby’s results® 
show a similar trend in reduction of errors 
with the exception of the timed dictation 
form, which he found slightly higher in errors 
than the story form. It is generally well- 
known that the recognition type of test 
seems to be much easier than the recall type. 
This fact may explain, at least partially, the 
marked decrease of errors in this type of test 
as shown in Table IV. 


Conclusion 
There is a marked variability of pupils 
on the several tests. Some of the words 


found difficult on one form of test seem 
5 Op. cit., p. 344. 
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easier when presented in another form. The 
story and the timed dictation types of tests 
appear most difficult. The multiple choice 
form is easiest for the majority of students. 
From the results on the multiple choice form 
of the test it would seem that this test is 
either sampling an ability that none of the 
other tests is measuring, or else shows the 
effect of practice on the test results. From 
the previous findings in the field of educa- 
tional research it would appear that perhaps 
the former possibility is the more valid as- 
sumption to apply here. Further research 
should be done with this type of spelling test. 

If the difficulty of the test is to be used as 
a basis for its validity, then the story form 
and timed dictation form would be the best 
for diagnostic testing. The _ variability 
(Standard Deviation) around the mean of the 
story form and of the timed dictation form 
is about the same, being 3.54 in the former 
case and 3.70 in the latter. The Standard 
Deviation around the mean of the timed dic- 
tation form is greater than the variability on 
any of the other tests. The normality of the 
data in this study is evidenced by the un- 
usual uniformity of both the means and the 
Standard Deviation of the means. When it 
is borne in mind that the pupils represented 
in this study came from three sections of the 
city of Nashville, Tennessee, representing a 
fair cross-section of the city, the normality 
seems even more significant. 

Other than the multiple choice form the 
list form has the highest average and the 
smallest Standard Deviation for correct spell- 
ings. If a pupil could spell at all it would 
seem that his performance could be measured 
with greater reliability on the list form than 
on any of the others. Since the differences 
between the list form, the story form, and the 
timed dictation tests are not reliable, the list 
form could be used with assurance that it 
would be as accurate a measure of the pos- 
sible errors as any of the other tests given. 
However, it should be pointed out that most 
of the pupils have been using the list form in 
spelling, which fact may have something to 
do with the consistency of the measures of 
central tendency and variability. 

These data in general verify the conclu- 
sions of Northby who made use of the forms 
used in this study as well as an additional 
oral test. The superiority of Northby’s group, 
as represented by mean scores, over the group 
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in this study would tend to indicate the 
presence of some selective factor or factors. 
These data indicate that different spelling 
tests tend to test different types of ability. 
A pupil may be able to recognize the correct 
spelling on the multiple choice form but not 
be able to spell it on any of the three recall 
tests. Furthermore, the results of this study 
seem to show that the situation in which a 
pupil used a word will make an important 
difference in his ability to spell it correctly. 
The findings in this investigation should be 
suggestive for further research. One question 
that arises in this study is to determine 
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whether or not there is a statistically reliable 
difference between the timed dictation test 
used in this study and the one used by 
Northby. A similar test should be constructed 
which would parallel the particular course of 
study in each state to see if the data support 
the conclusions of this study and of Northby’s 
investigation. The fact that pupils do so 
much better on the multiple choice type of 
test might cause a marked influence on the 
other forms, should it be given first. The 
order of presenting the different forms of the 
test might cause a marked difference in 
results. 


















\ study of the status of proficiency in 
English attained by students attending insti- 
tutions educating teachers seems quite im- 
perative at this time in order that a begin- 
ning point may be found from which to un- 
dertake obviously needed improvements in 
the programs of English teaching offered by 
these institutions. It is a matter of rather 
general knowledge that the status reached in 
English compositional skills is one of low pro- 
ficiency for the majority of college freshmen. 
If the teacher-training institutions of the 
country would make a concerted attack upon 
raising this status for the students whom they 
are to send out to be teachers in the elemen- 
tary and secondary schools, much could be 
done to raise the level of proficiency in Eng- 
lish skills throughout this country. 

A somewhat expository statement of the 
problem of this entire study or investigation 
is this: a study of the status in basic skills 
in written Kngush attained by freshmen in 
three teachers colleges, together with sugges- 
tions growing out o1 the implications of that 
study for certain revisions in the course in 
English composition on the college level. 

A statement of the problem in terms of 
some of the specific objectives is given in 
order to define in a more specific way the 
delimitations accepted. 

*1. To carry out a satisfactory testing 
program in order to discover the status ot 
proniciency in certain basic skilis in written 
English attained by all groups in the 
experiment 


t An abstract of part of a thesis submitted in partial ful- 
fillment of the requirements for the degree of Doctor of Phi.- 
osophy, Department of Education, Graduate College of the 
State University of Iowa, 1937. 

The present study had its origin in a suggestion given by 
Dr. Ernest Horn. To him and to Dr. Harry A. Greene, the 
writer of this desires to express sincere gratitude for 
their guidance in directing the study. 

rhe writer is grateful to the several institutions which co- 
operated so generously in carrying on this investigation. 

fo others, the writer is indebted for generously giving of 


their time and services: Dr. Helen K. Mackintosh, Mrs. 


James McKenzie, Dr. Luella Wright, Dr. W. R. Lueck, Dr. 
. F. Carpenter, and Dr. Walter W. Cook. 
ots Present abstract deals with phases of the objectives 
starred, 
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2. To study compositions, tests, and 
other material written by the students 
tested 

*3. To make a detailed inventory of the 
results of the testing program and of the 
composition materials selected for that 
purpose 

*4. To analyze the results of the tests 

and to analyze the compositions for types 
ot errors in basic skills. 

*5. To interpret the results of the pro- 
gram of testing and composition writing 

6. To evaluate the results of the entire 
program 

7. To develop from the inventory taken, 
techniques useful in making an inventory 
of the needs of college freshmen in basic 
skills in English 

8. To carry on, at one of the colleges, 
during the school year 1931-1932, a reme- 
dial program with the group of about 
twenty students ranking lowest on the in- 
ventory tests of basic skills in written 
English 

9. To develop from the work with the 
remedial group procedures which should be 
effective with any group of freshmen weak 
in basic skills in written English 

10. To evaluate the remedial program 

*11. To work out suggestions from the 
evaluations of the testing program and the 
remedial program of teaching for certain 
revisions in the course in English composi- 
tion for teachers colleges 


The present study is made in an attempt 
to help raise the standards of training in the 
basic skills in written English to a higher 
level of proficiency. The data collected in 
this study yield indications in favor of a pro- 
gram of self-instruction in English composi- 
tion on the college level. 

This study of status was carried on in 
three teachers colleges in two states of the 
Middle West. The three colleges will be 
designated in the thesis as College A, Col- 
















































lege B, and College C, respectively. The 
group of sixteen freshmen from College C 
chosen for special study will be called the 
Remedial Group. As a point of departure 
for the study of status, a summary of cer- 
tain points of general information about the 
three institutions concerned is presented in 
Table I. 


TABLE I 


NUMBER OF STUDENTS TESTED AT THE 
THREE COLLEGES 





Number of 
Cases in 
College Year Group the Ex- 
periment 
A 1931-1932 Freshmen 54 
(Four-year Sophomores 31 
school) Juniors 29 
Seniors 35 
B 1931-1932 Freshmen 62 
(Four-year Sophomores 14 
school) Juniors 10 
Seniors 8 
C 1931-1932 Freshmen 
(Two-year (General Group) 109 
school Freshmen 
(Remedial Group) 16 
Sophomores 24 
Cc 1934-1935 Freshmen 109 
Sophomores 24 
1931-1932 Freshmen 241 
Totals Sophomores 69 
Juniors 39 
Seniors 43 
1931-1932 Total 392 
1934-1935 Total 133 


Grand Total 525 


In this study, conducted along the lines of 
a combination of the experimental, compara- 
tive, and survey techniques, College C car- 
ried on the most extensive program. The 
other colleges carried on less extensive pro- 
grams of testing, useful mainly as checks on 
the program conducted at College C. 

The study was begun in the Fall of 1931. 
During the school year of 1931-1932, Fall 
and Spring testing programs in basic skills 
in written English were conducted at the 
three colleges named. In 1933, an extension 
of the study was planned for the school year 
1934-1935 at College C. It was believed 
that by carrying on a similar testing program 
in 1934-1935, some comparisons might be 
drawn between the scores made at College C 
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in that year and those made there in 1931 
1932. 
The tests chosen for the main experiment 
were these: 
Kuhimann—Anderson Intelligence Tests, 
Grade 1X—Maturity 
Otis Self-Administering Tests of Mental 
Ability, Form A 
lowa Placement Examinations of English- 
Aptitude, Series 1, Revised, Forms A 
and B 
Iowa Placement Examinations of English- 
Training, New Series, Forms X and Y 
O’Rourke Grammar Survey Tests, Forms 
A and B, Tests 1, 2, and 3 
Cook Spelling Test, Forms A and B' 


A phase of the program of 1931-1932 was 
projected into 1934-1935, at College C. 
During that year the Jowa Placement Exam- 
inations of English-Training were given to 
109 freshmen in the Spring of 1935. 

Compositions were collected for analysis. 
The checking of the compositions was done 
by the writer with very careful reference to 
the Criterion.2, The compositions used had 
already been checked by the regular teachers 
of English. More errors were located by 
means of the checking done with the Cri- 
terion than had been found by means of the 
checking done without it. The study of status 
resolved itself into a study of the results of 
the testing survey, and of the analysis of 
compositions. A suggested program grew 
out of the interpretation of these results. 

The data from the 1931-1932 testing pro- 
gram are summarized in Table II, the basic 
or key-table in this study. 

This table reports the findings on status of 
proficiency in written English revealed by the 
test scores made on the O’Rourke Grammar 
Survey Tests, Forms A and B,* taken by the 
four groups from the three colleges; on the 
lowa Placement Examinations of English- 
Training, Forms X and Y, taken by the Gen- 
eral Group and by the Remedial Group at 
College C; and on the Cook Spelling Test, 
Forms A and B,* taken by the four groups 


' Walter W. Cook, “The Measurement of General Spelling 
Ability Involving Controlled Comparisons Between Tech- 
niques.” Un ished Doctor’s thesis. Iowa City, Iowa: 
State University of Iowa, 1932. : 

2 Harry A. Greene, A Criterion for the Course of Study in 
the Mechanics of Written Composition. University of Iowa 
Studies, New Series, Number 4. Iowa City, Iowa: State 
University of Iowa, 1933. 

?L. J. O'Rourke, O’ Rourke Speer Tests of English Usage, 
Forms A and B. Washington, D. C.: Educational and Per- 
sonnel Publishing Company, 1930. 

*W. W. Cook, op. cit. 
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at the three colleges. One form of each test 
was given in the Fall, and another, in the 
Spring. The coefficients of correlation given 
in column 9g of the table indicate that very 
little gain had been made from the Fall test 
to the Spring test by the general groups. On 
the other hand, the low correlations for the 
Remedial Group indicate quite pronounced 
gains for that group from Fall to Spring tests. 
A glance at this column will show that the 
reliability was high in total score averages 
for all but the Remedial Group on the Jowa 
Placement Examinations. The small number 
of cases and the unusually large amount of 
gain may account, in part, for the low corre- 
lation for the total score averages for this 
group. The negative correlation on Part 2 
of the same test for this group has the effect 
of lowering the correlation on the total. 

The calculated sigma of the correlation and 
the sigma of the average, in almost every 
case, indicate that the data are reliable 
enough to be used as bases for drawing im- 
plications and conclusions. In the case of 
the Jowa Placement Examinations taken by 
the Remedial Group, the small number of 
students (sixteen) makes conclusions on the 
results for the group as a whole less reliable 
than those for the larger groups. However, 
the group will be studied both for status as 
a whole, and by cases, in order to provide a 
check on the study of status for the larger 
groups. Chapter V of the thesis gives a de- 
tailed report on the status of the Remedial 
Group. 

The status for each group on all the tests 
at the beginning of the year, expressed in 
terms of the arithmetical means, is given in 
the sixth column of Table II, first part. 

In the case of the O’Rourke Tests,’ the 
status revealed by the average scores made 
on Form A of the test, taken in the Fall, was 
below the college norm for each group. The 
status of the Remedial Group was far below 
the norm. 

On the Jowa Placement Examinations, the 
General Group at College C was slightly 
above the norm at the beginning of the year, 
while the Remedial Group was far below. 

For the Cook Spelling Test,° a comparison 
of the means in spelling shows that the Reme- 
dial Group was not the lowest in initial sta- 
tus. The ratings are high enough for all 
groups to show that the spelling test used 


SL. J. O'Rourke, op. cit. 
®*W. W. Cook, op. cit. 
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was not adequate for measuring the spelling 
ability of the students who were the best in 
spelling. (This is especially true of the final 
tests, on which 62 freshmen out of 234 made 
perfect scores. Many others made scores 
ranging from 47 to 49, when 50 was the per- 
fect score. The actual gains, therefore, may 
be greater than the data indicate.) 


Column 6, part 2, of the same table pre- 
sents the status on the second form of the 
test, taken at the end of the year. On the 
O’Rourke Tests,’ College B and the General 
Group at College C exceeded the norm for 
total score; College A and the Remedial 
Group did not quite attain it. The Remedial 
Group, however, made a far greater gain 
from Fall score to Spring score than did any 
other group and stood at an average of 56.06 
points at the end of the year, while the norm 
is 67.17. The average score for the 225 
freshmen of the general groups at the end of 
the year was 63.38 as compared with the 
average score of 56.06 for the sixteen mem- 
bers of the Remedial Group. The statistical 
significance of these gains is made clear in 
Table II. 


By the end of the year, the General Group 
at College C had gained 22.58 points from 
average to average, and was well above the 
norm. Seventy-two cases out of this group 
of freshmen were still below the norm in total 
score on this test. The Remedial Group had 
gained 46.69 points, but was still 20.48 points 
below the norm on general average. One out 
of the sixteen members of the Remedial 
Group had exceeded the norm. 


At College A, the gains on total score from 
Fall to Spring were very small, only 1.44 
points. On Parts 1 and 2 of the test, the 
gains were small and on Part 3, there was a 
loss. The proportional amount of gain was 
almost negligible. 


The average scores at the end of the year 
on the Cook Spelling Test* were so high that 
there seemed to be additional proof for stat- 
ing that this test had not been difficult 
enough to measure ability above that of the 
average freshman. 


At College B, the gains from Fall testing 
to Spring testing were considerably greater 
than were those for freshmen as made at 
College A, both on total scores and on scores 


™L. J. O'Rourke, op. cit. 
*W. W. Cook, op. cit. 
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for separate parts of the O’Rourke Tests.° 
The proportional gains were much greater. 

For College C, the same kinds of gains fell 
between those of the other two colleges for 
the General Group of freshmen. For the 
Remedial Group the point gains and the pro- 
portional gains were greater than for any of 
the groups mentioned. The gains were so 
much greater that they could not be ac- 
counted for by mere chance or by the single 
fact that the Remedial Group had twenty- 
four more class-hours in their first quarter 
than did the other students tested at Col- 
lege C. Their gains were greater than the 
gains made by the total freshman group 
(General Groups) of 225 students from the 
three colleges. 

The returns from the O’Rourke Test were 
not dissimilar from those from the Jowa 

lacement Examinations. The comparison 
of the returns follows: The Jowa Placement 
Examinations were given at the home school 
only, making comparisons between the Gen- 
eral Group of College C and the Remedial 
Group the only ones possible. The gains for 
the achievement on Jowa Placement Exam- 
inations of English-Training were greater on 
total score and on the separate parts of the 
test for the Remedial Group than for the 
General Group in terms of points. In terms 
of proportional gains, the Remedial Group 
exceeded the General Group on all but Part 4 
of the test. On Part 4, the General Group 
had the slight advantage of .038 over the 
Remedial Group. (The proportional gain 
was found by dividing the actual gain made 
from Fall score to Spring score by the pos- 
sible gain. The possible gain is that which 
could have been made from the standing on 
the Fall test to a perfect score.) 

The one exception is that of the spelling 
gains made by College B, which were 4.69 
points higher than those for the Remedial 
Group. This may be accounted for, in part, 
by the fact that the initial average for the 
Remedial Group was 1.66 points higher than 
that for the group from College B. 

On the Cook Spelling Tests,’° the greatest 
gain was made by College B, the next 
greatest by College A, the third greatest by 
the Remedial Group, and the fourth greatest 
by the General Group at College C. In the 
last named case, a fair comparison cannot be 
made, however, since the fact that some 


*L. J. O'Rourke, op. cit. 
"W. W. Cook, of. cit. 
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papers were lost from the Fall returns made 
a balanced comparison of scores for the 
entire number of cases impossible. 

On the O’Rourke Tests, the greatest 
gains made by each group were those made 
on Part 2, the test on adjectives, adverbs, 
prepositions, and conjunctions. The greatest 
gains on the Jowa Placement Examinations 
were made in punctuation by the General 
Group and by the Remedial Group. 


The data mentioned from this one table, 
Table II, together with the statistical calcu- 
lations included in the table to give evidence 
of reliability, indicate that the Remedial 
Group made greater actual point gains in 
almost every part of the three tests reported 
upon in this table than did any other group. 


It seems safe to conclude that the tests in- 
dicate that very significant gains have been 
made on all of the tests by the Remedial 
Group. In terms of proportional gains (col- 
umn 11), those made by the Remedial Group 
are about twice those made by all freshmen 
on the O’Rourke Tests. On the lowa Place- 
ment Examinations, the proportional gain 
total for the General Group is .217, while 
that for the Remedial Group is .288. The 
difference, however, between the average 
gains in points made by the two groups is 
24.11, in favor of the Remedial Group. 

The greater gains made by the Remedial 
Group may be accounted for largely in terms 
of type of training given. The average of in- 
telligence for the Remedial Group as com- 
pared with that for the General Group at 
College C in 1931-1932 was much lower. In 
fact, the Remedial Group ranked lower than 
did the freshman group at College C in 1934— 
1935. Data in regard to intelligence were 
collected. 


The testing program of 1934-1935 re- 
vealed a status at School C similar to that 
found in the same school in 1931-1932. 


Error counts were made for several sections 
of the tests. A study of errors made in com- 
position writing by the Remedial Group, and 
by thirty of the freshmen of 1934-1935, was 
worked out in order to find whether the errors 
in basic skills in written English made on 
compositions were the same as those made 
on the tests by large percentages of the stu- 
dents. The types of errors on both tests and 
compositions were similar. 

"uL. J. O'Rourke, op. cit. 
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Any study is beset by limitations, some of 
which are subjective, and some, objective. 
The chief limitations in the present study 
are those usually conceded. 


The summary and the conclusions in re- 
gard to status follow. This study has tended 
to verify empirical knowledge in regard to 
the status of proficiency in the basic skills in 
written composition attained by college fresh- 
men in three teachers colleges of the Middle 
West. 

The findings in regard to status are sig- 
nificant, when it is remembered that the stu- 
dents who go out to teach with low rating on 
these tests will not be able to give adequate 
training to those whom they are to teach at 
the elementary school level. As a result, 
many more students will come to college un- 
prepared in basic skills in written composi- 
tion. 

It was found, then, from the studies of 
1931-1932 and 1934-1935 that: 


1. The status of proficiency for the fresh- 
men of 1931-1932, as measured by average 
scores on the tests used, was low at the be- 
ginning of the year for all freshmen. It was 


very low for those in the Remedial Group. 


2. Every group of freshmen made some 
gain by the end of the year on all of the 
tests, save the freshmen from the first col- 
lege. On Test 3 of the O’Rourke battery 
they registered a small loss. 


3. The status of proficiency at the end of 
the year (1931-1932) was above the norm 
in too few of the possible cases on all the 
tests given. In general, it would have to be 
admitted that the status was somewhat low 
on all the tests, especially when it is remem- 
bered that many of the students tested were 
going out to teach the subjects in which they 
had made low scores. 


4. The status for freshmen, sophomores, 
juniors, and seniors of 1931~1932 was not 
dissimilar from that of the freshmen, if that 
conclusion may be inferred from the results 
for the small number of these classes tested. 
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5. The greatest gains on the tests wer 
made by the Remedial Group. The data sup. 
porting this statement and the details of the 
program worked out would seem to point t 
the fact that the program of training give: 
the Remedial Group was somewhat effective 
From this program certain procedures hay 
been selected as worthy of further use in ap 
experimental way. These procedures appear 
in Chapter VII of the writer’s thesis. 

6. The points in English which large per. 
centages of these students missed on the tests 
and in the compositions are those which 
should have been taught in the elementary 
school, according to Cesander’s'* study of 
grade placement. 

7. The types of errors appearing most fre- 
quently on the tests are those which appeared 
most frequently in the compositions analyzed 
This fact indicates that the use of these tests 
for purposes of diagnosing the needs of stu- 
dents in the fields tested is a valid procedure 

8. The status of proficiency on the part 
of freshmen at College C in the Spring oi 
1935 was about the same as that for the — 
freshmen of the same school in the Spring of 
1932, judging from results on comparable 
forms of the Jowa Placement Examination; 
of English-Training. It seems somewhat evi- 
dent that the program of instruction in Eng- 
lish composition at that school has been static 
for the period of years from 1931 to 1934. 
Intelligence test scores, for both 1931 and 
1934, bear out the fact that native ability 
was about on a par for the groups of fresh- 
men tested during those two years. There- 
fore, the condition cannot be attributed to 
variations in native ability. 

In general, it may be concluded from the 
study of status made at the given institutions 
through the media of testing and composition 
analysis that only a small number of students 
are adequately prepared for the high art oi 
teaching the basic skills in written English. 
It is a problem which demands serious atten- 
tion and action. 


Paul K. Cesander, “A Study of Pupil Usage as a Fact 
» the Grade Placement of Certain Items of Punctuation 

Vol. I. Unpublished Doctor's thesis. Iowa City, Iowa: Sita! 
University of lowa, 1931. 
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THE EVALUATION OF A TECHNIQUE FOR MEASURING 
IMPROVEMENT IN ORAL COMPOSITION* 


R. F. NETZER 
Arizona State Teachers College, Tempe 


During the past decade workers in the ele- 
mentary school field have pointed out that 
emphasis on the development of the skills in 
written and oral expression must coincide 
more closely with the life demands on lan- 
guage. They have shown that most of the 
out-of-school demands on language ability 
are of the oral type and that teachers spend 
too much time on the mechanical and tech- 
nical aspects of language. Perhaps the rela- 
tive ease of judging the mechanical and tech- 
nical aspects and the lack of an objective 
means of judging oral productions have led 
to this overemphasis on written language. 
Also, textbooks and courses of study have 
been much less provident of materials for 
oral language. 

Modern curricular emphasis is being ad- 
justed to provide more adequate oral lan- 
guage instruction and with this expansion 
have come certain problems in materials and 
methods. One of these problems is the dis- 
covery of a technique for measuring improve- 
ment in oral composition. A common means 
of measuring improvement in written compo- 
sition has been a scale. Whether this tech- 
nique is applicable to the measurement of im- 
provement in oral composition has not been 
determined, because until very recently no 
method of making exact reproductions of oral 
compositions was available. Now that exact 
reproductions can be made by means of re- 
cording equipment it is possible to evaluate 
the scale as a method of measuring improve- 
ment in oral composition. The purpose of 
this study was to evaluate experimentally an 
oral language scale as a means of revealing 
group and individual growth in ability to use 
oral language skills. Because no adequate 
oral language scales were available, the con- 
struction of such instruments became a neces- 
sary preliminary to the experimental phases 
of the study. 

In this discussion the term “scale” is con- 
sidered to mean a group of children’s pro- 


*Summary of a doctoral dissertation. University of Iowa, 
1029 
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ductions ranging from very poor to very 
good, arranged in ascending order and of such 
quality that the second is approximately as 
much better than the first as the third is 
better than the second and as the fourth is 
better than the third. 

Research has shown that rating by a gen- 
eral merit scale is much more reliable than 
rating by an analytical type of scale, that 
themes on the same topic and themes with a 
wide range of quality are easier to rate, and 
that there is a definite relationship between 
general quality rating and rating for specific 
factors. The inadequacy of written scales 
for judging oral compositions has also been 
demonstrated, since they do not show the in- 
fluence of difference in vocabulary and ex- 
pression found in oral composition. Chil- 
dren, like adults, use words and expressions 
in speaking which they do not employ in 
writing. In a previous investigation the 
writer found it impossible to use a written 
composition scale to rate oral compositions. 
Too small a difference in quality between the 
poorest and best production was revealed 
through the use of a written scale. The sen- 
tence structure and usage in oral composition 
are inferior to those of written composition. 
Some of the constructions which are consid- 
ered unpolished when they appear in written 
form, apparently add to the effectiveness of 
oral production—sentence fragments, for in- 
stance. It is also valid to say that the vocab- 
ulary of oral composition is not so limited as 
the vocabulary of written composition, since 
children and adults often use words in speak- 
ing which they do not employ in writing be- 
cause of inability to spell such words. Thus 
the standards for evaluating oral and written 
composition must differ. 

The first problem was to find satisfactory 
means of stimulating oral composition. After 
trying out a large number of procedures 
which had been suggested by teachers and 
supervisors, three, which appeared satisfac- 
tory, were selected for use. They were a pic- 
ture, an incomplete story, and an object. 
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The next problem was to select suitable 
materials of each of these three types. The 
results of studies pertaining to the preference 
of children for certain types of pictures do 
not agree and there have been no studies con- 
cerning the relative ability of different types 
of pictures to stimulate language expression. 
Likewise there is no reason to believe that 
the picture which is most liked will best 
stimulate expression. Beautiful pictures and 
scenes may be much liked but be beyond the 
power of anyone but an artist to express in 
words. It is reasonable to assume that the 
picture selected must contain a few signifi- 
cant details which are prominent and in the 
foreground. 

The writer has been unable to discover any 
studies concerning the type of story which 
best stimulates language expression. It seems 
reasonable that the incomplete story selected 
must tell enough to give the main idea but 
still leave something to be told, must end at 
an interesting point so that the child will be 
stimulated to complete it, and must not ex- 
actly determine the completion but leave 
more than one solution possible. 

Studies of the relative ability of objects to 
stimulate expression are lacking, but it is rea- 


sonable to require that the object selected be 
something that a child has seen or read about, 
realistic in shape and color, and perhaps 


something a child can handle. Each of the 
things selected must be within the range of 
experiences and understanding of children 
and must be something which allows for ex- 
tensive expression by those capable of ex- 
pressing themselves in greater length and 
detail. 

The three stimuli selected for each grade 
on the basis of the above criteria were pre- 
sented to groups of pupils in the fourth, fifth, 
and sixth grades of the University of Iowa 
Elementary School and the pupil's responses 
were recorded by the Iowa Oral Language 
Recording Equipment. The children were 
told that the investigator was making a col- 
lection of children’s stories and that their re- 
sponses were to be recorded. The children 
were not “microphone conscious” due to the 
fact that they had seen the microphones be- 
fore and had had their responses recorded for 
other studies in previous years. When the 
hour was over many of them wanted to re- 
main longer and, when told that this was 
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impossible, asked if they might return at 
another time. 

The records of the 135 responses were 
transcribed and proofread. Fifty-one were 
from the fourth grade, forty-two from the 
fifth grade, and forty-two from the sixth 
grade. No effort was made to limit the num- 
ber of compositions. All that were given were 
recorded and transcribed. The compositions 
varied in length from thirty-eight to ror, 
words. 

An analysis of these compositions showed 
that in general the object used was the best 
of the three methods of stimulating oral com- 
position, the story second best, and the pic- 
ture the poorest method, although the differ- 
ences were not statistically significant. On 
the basis of these results it seemed advis- 
able to use the same three types of stimuli 
and to select one of each type for each of 
grades two and three. Those selected were 
considered suitable for the purpose by several 
teachers of grades two and three. 

Permission was obtained to record the re- 
sponses of children in four school systems in 
lowa. These systems ranged in size from a 
small town to one of the larger cities of the 
state. Because the recording equipment re- 
quired rro A. C. 60 cycle current it was im- 
possible to use all the school systems which 
were willing to cooperate. The teachers in 
these systems were requested to select the 
five poorest and five best pupils in oral lan- 
guage in their rooms. The writer was pri- 
marily interested in obtaining samples which 
ranged from extremely good to extremely 
poor rather than in the grade level from 
which they were obtained. This range was 
necessary in order to obtain adequate spread 
for the scale. Furthermore, samples with a 
wide range of quality were found to be more 
easily and reliably rated. 

The recording equipment was taken to each 
school, the stimuli presented, and the pupil’s 
responses recorded in the same way as at the 
University Elementary School. The responses 
were transcribed and proofread. In this way 
249 responses were obtained which ranged 
from those of poor second graders to those 
of good sixth graders. These responses were 
combined with those obtained at the Univer- 
sity Elementary School. This group of 384 
responses was used as the source of material 
for developing the scales. 
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In deriving the quality scales an exact copy 
of each of the 384 responses was used. 
Although the compositions contained many 
incomplete sentences, sentence fragments, 
and repetitions of words and phrases, they 
were not smoothed because the exact response 
was desired for analysis. 

Because it would be nearly impossible and 
extremely unreliable to have the judges ar- 
range the total 384 responses in order of 
quality from poorest to best, it was decided 
that some preliminary elimination was neces- 
sary. This was accomplished by having each 
of five qualified judges independently arrange 
this total group of responses in ten piles, each 
pile representing a different level of quality 
from very poor to very good. Thus, those 
that each judge considered the best he placed 
in pile one, those considered poorest in pile 
ten, and those judged to be of a value be- 
tween these two extremes he placed in the 
appropriate one of the eight intervening piles. 

Those most often falling in each of the ten 
groups were selected for use in determining 
the final scales. These were consciously se- 
lected so that they would represent the total 
range of quality from very inferior to very 
superior. For example, samples 31 and 47 
were selected as most representative of the 
third group because three out of five judges 
placed sample 31 in group three and two 
placed it in group two, and thus it probably 
represents a value slightly less than three; 
sample 47 was placed in group three by four 
judges and in group four by one judge, and 
thus it represents a value slightly more than 
three. In this manner twenty-five samples 
were selected so that they represented as 
nearly as possible a continuous range from 
good to poor. 

The original plan was to develop a general 
scale from the responses to the three stimuli. 
This was abandoned because the different 
stimuli influenced to some extent the re- 
sponses, i.e., the response to an incomplete 
story differed greatly from the response to a 
picture because of the inherent difference in 
the stimulus itself. In this way, also, a fre- 
quent criticism of written scales (that the re- 
sponses are of a single general type and diffi- 
cult to make comparisons with unless the 
same stimulus is used) was overcome. There- 
lore twenty-five samples resulting from the 
picture as a stimulus, twenty-five samples re- 
sulting from the object as a stimulus, and 
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twenty-five samples resulting from the story 
as a stimulus were selected as the basis of the 
three types of scales. 

The samples for each stimulus were scaled 
by means of the method developed by Thur- 
stone.’ The basis of this technique is the 
composite opinion of qualified judges who 
rated each of the samples in relation to every 
other sample. Each of the samples was rated 
by twenty-five judges. A chance-half corre- 
lation was calculated for these judgments to 
obtain some idea of the consistency of judg- 
ments. The correlations which resuited were 
.98, .98, and .97 for the judgments on the 
story, picture, and object samples respectively. 
This indicated that there was close agreement 
on the part of the judges as to the relative 
position given each sample. 

The evaluation of the technique was made 
by using the scales to assign values to the 
oral responses of each pupil. Almost exactly 
a year after the first oral language records 
were obtained the same schools were revisited 
and additional recordings of the oral produc- 
tions of the same pupils were made. Each 
pupil was asked to respond to the stimuli to 
which he had responded the year before. 
These responses were recorded in the same 
way and under the same conditions as those 
of the previous year. Two sets of records, a 
year apart, were then available for analysis. 

Three judges were trained to use these 
scales. When a judge was able to use the 
scales with facility and accuracy he was given 
copies of the 1936 samples, arranged accord- 
ing to stimulus, and a copy of each of the 
three scales. He worked independently, as- 
signed a value to a sample, and recorded these 
ratings on a sheet prepared for the purpose. 
After a group was completed the judge was 
allowed to go back and adjust the values as- 
signed if he wished. Several weeks later each 
judge again practiced using the scales and 
was then given the 1937 samples to be rated 
in the same way. 

These ratings were used to evaluate the 
scales. There were 249 samples recorded in 
1936 but in 1937 it was possible to locate 
only 199 of the same children. The others 


had transferred from the school systems in 
which they had formerly been registered. 
The evaluation of the scales is therefore based 
on the 199 pairs of responses. The results of 


1L. L. Thurstone, “An Experimental Study of Nationality 
Preferences,’ Journal of General Psychology, I (July, 1928), 
405-423. 
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these ratings are summarized in Table I 
which shows that with one exception the 
differences are statistically significant. 

For the purpose of analyzing individual 
gains and losses in oral composition ability, 
as measured by these scales, distributions 
were made of the number and percentage of 
pupils who made a gain or a loss for each 
stimulus from 1936 to 1937. Table II gives 
a summary of the results. This table shows 
that the gain in oral language ability, as 
measured by the scales, was always greater 
than the loss after the lapse of a year. When 
the story was used as a stimulus and the re- 
sulting responses were rated by means of the 
story scale, 22 percent more pupils made a 
gain than made a loss. Similar analysis shows 
that when the picture was used the excess of 
pupils making gains rather than losses was 
40 percent and when the object was used this 
excess gain was 54 percent. Table II also 
shows that with one exception the percentage 
of pupils making a gain exceeded the per- 
centage of pupils making a loss for each in- 
terval when the gains and losses were tabu- 
lated by intervals of one. That is, more 
pupils made a gain of from zero to one unit 
than made a loss of from zero to one unit; 
more made a gain of from one to two units 
than made a loss of from one to two units, 
and similarly for from two to three units and 
above three units. The only exception that 
was found was when the story was used as a 
stimulus. Two percent more pupils made a 
loss of over three units than made a gain of 
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over three units for this stimulus. It was 
pointed out before that when this scale and 
stimulus were used the amount of improve- 
ment found was not statistically significant 
However, this stimulus did show a greater 
percentage of pupils gaining than losing for 
the total group and in the intervals from one 
to three. 

Table II also shows that, with the excep- 
tion of the responses to the story stimulus, 
the average gain of those who gained was 
always greater than the average loss of those 
who lost. The excess of average gains over 
average losses was —.1, .6, .6, and .4 for the 
story, picture, object, and the combination of 
the three respectively. Previous to this time 
it has been taken for granted that most chil- 
dren would make a gain in oral language abil- 
ity during a year’s instruction in this subject. 
However, adequate proof of this fact and a 
definite knowledge of the extent of this gain 
were not available. This experiment furnishes 
proof that this assumption of a significant 
growth in oral language ability is correct. It 
shows that as a result of a year’s school ex- 
perience the majority of the pupils made sig- 
nificant gains in ability to express themselves 
orally. 

As a result of this study it seems that the 
following conclusions are valid: 


1. The experience in this study indicates 
that the Thurstone method of scale con- 
struction is a reasonably effective means 
of scaling children’s oral productions. 


TABLE I 
SUMMARY OF THE RESULTS OF THE RATINGS ON THE 1936 AND 1937 SAMPLES 


A. M. 8. D. S. E. 


AM 


S. E. 


r Diff. 
-_— diff 








3.35 .13 15 


4.11 . 96 .14 


3. 86 . 82 . 33 
4.16 . 62 .23 


2. 58 . 84 . 20 
3.35 . 68 -18 


3.97 . . 25 
5. 05 . . 26 


.13 











September, 1937 | 





TABLE II 


Gain per Unit 


S Story 












Gain 60% Otol 32% 
r Loss 38% 1.2to2 12% 
r Average gain 1.4 2.2to3 12% 
‘ Average loss 1.5 3.2 up % 
© Picture 
Gain 69% Otol 25% 
Loss 29% 1.2to2 23% 
Neither 1% 2.2to3 15% 
Average gain 1.7 3.2 up 6% 
Average loss 1. 1 
Object 
Gain 79% Otol 32% 
Loss 25% 12to2 19% 
Neither 2% 2.2to3 14% 
Average gain 1.7 3.2 up 14% 





Average loss 1.1 
* Combination of Story, Picture and Object 
0 







Gain 68.5% 28.5% 
Loss 30% 1.2to2 18.5% 
Neither 1% 2.2to3 13.5% 





Average gain 1.6 3.2 up 8 % 


Average loss 1.2. 














provement in oral composition can be 
measured effectively by means of an 
oral composition scale. 

3. Two of the three scales developed in this 
study prove to be useful for measuring 
improvement in children’s oral produc- 
tions. The critical ratios for the picture 









respectively. 
4. Even though the critical ratio for the 
story scale is only 1.07, the story scale 
would possibly be useful for measuring 
improvement, if the inherent limitations 
of the responses due to the type of 
stimulus used are taken into account 
during the rating process. 
















PERCENTAGE OF PUPILS MAKING A GAIN OR LOSS FROM 1936 TO 1937 


2. From the point of view of English, im- 5. 


and object scales are 4.53 and 4.91 6 
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Loss per Unit 


Oto—1 20% 
—1.2to—2 6% 
—2.2to—3 6% 
—3.2 down 6°; 

0Oto—1 18% 
—1.2to—2 7% 
—2.2to—3 3% 
— 3.2 down 1% 

O0Oto—1 18% 
—1.2to—2 8% 

-2.2to—3 22% 
—3.2 down 2¢ 

O0to—1 17% 
—1.2to—2 7 
—2.2 to —3 3.5% 
—3.2 down 2.5% 






Observation of the work of the judges 
during the training period indicates that 
teachers can be trained to use an oral 
composition scale to judge the oral pro- 
ductions of their pupils with a resultant 
increase in the reliability of their re- 
corded judgments. 


. As would be expected, more pupils in- 


crease in ability to express themselves 
orally as a result of a year’s school ex- 
perience than decrease in this ability, 
and the average increase of children in 
ability to express themselves orally is 
greater than the average decrease in 
ability to do so. 















THE RELATIONSHIP BETWEEN SCORES ON THE SCHOLASTIC 
APTITUDE TEST AND COLLEGE MARKS IN CHEMISTRY 


M. RicHARD DICKTER 
Philadelphia, Pennsylvania 


THE SCHOLASTIC APTITUDE TEST 

In 1926 the College Entrance Examination 
Board devised an aptitude test, known as the 
Scholastic Aptitude Test. It is a test offered 
to colleges and universities for the purpose of 
testing all candidates for admission to these 
institutions. No institution, however, is obli- 
gated to give the test; acceptance is entirely 
on a voluntary basis. 

The test, from 1930 until 1936, the period 
covered by this investigation, consisted of two 
sections: one of which is material essentially 
linguistic or verbal in nature and another of 
which is material essentially mathematical in 
nature. The test aims to measure potential 
ability to do work on the college level along 
specific lines, rather than to measure ability 
in general. A primary purpose of the Scho- 
lastic Aptitude Test is its use as an aid in 
guidance and teaching in college; for ex- 
ample, selecting students for admission to the 
various fields of study, guiding them in their 
choice of majors, forestalling through super- 
vision the possible failures of doubtful stu- 
dents, diagnosing the mediocre performance 
or failures of promising students, and section- 
ing groups for instructional purposes.’ 


The Verbal Section 

The Scholastic Aptitude Test was first 
given to applicants for college entrance by 
the College Entrance Examination Board in 
the year 1926. From 1926 to 1929 inclusive, 
this test consisted only of a verbal or non- 
mathematical section, the composition of 
which changed during these years. In 1926 
and 1927 the verbal section consisted of nine 
sub-tests. In 1928 the number of sub-tests 
was reduced to seven, and further reduced to 
six in 1929. From 1930 to 1936, the period 
covered by this study, the verbal section con- 


'E. G. Bill and H. Pennypacker, ‘‘Two Deans Consider the 
Scholastic Aptitude Tests,’ Independent Education, III 
(October, 1929), 17 
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sisted of three sub-tests, totalling two hun- 
dred items: 

1. Antonyms 

2. Double definitions 

3. Paragraph reading 


The Mathematical Section 

Not long after the inception of the Scho- 
lastic Aptitude Test, it became evident that 
the test was better adapted to selecting stu- 
dents for those courses in which verbal abil- 
ities are required, such as history, English, 
and Latin, than to selecting students for the 
mathematical and scientific courses, where 
mathematical abilities are essential. It was 
felt that the verbal section alone was not 
sufficient, because it measured linguistic abil- 
ity only and not the ability that is thought 
of as the mathematical and scientific.” 

The dissatisfaction became _ sufficiently 
acute to move the Commission on Scholastic 
Aptitude Tests of the College Entrance Ex- 
amination Board to add a second section, in 
1930, to the Scholastic Aptitude Test, after 
experimenting with such a section in 1928 
and 1929. ‘This second part was a mathe- 
matical section consisting of one hundred 
items, arranged in the order of difficulty, 
dealing with arithmetic, algebra, and geom- 
etry, and designed better to select students 
for the mathematical and scientific courses. 


THE PROBLEM 


The Scholastic Aptitude Test is an instru- 
ment furnished by the College Entrance Ex- 
amination Board and is used by colleges and 
universities as an aid in selecting students for 
admission to the various fields of study, and 
as an aid in guidance. Considerable time and 
effort have been, and are being, spent on it. 
The question arose at the University of 
Pennsylvania of the degree of relationship be- 
tween the scores made by students on the 
mathematical section of the Scholastic Apti- 


2 Annual Report of the Commission on Scholastic Aptitudc 
Tests, 1929, p. 38. 
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ude Test at the time of their entrance to the 
University and their latter academic success, 
is judged by their marks, in specific courses 
in first-year chemistry. This study was un- 
dertaken in order to answer this question. 

At the same time, the relationship between 
the scores made by the same students on the 
erbal section and their marks in the same 
courses was also determined, thus making it 
possible, in addition, to compare the relative 
merits of the two sections as far as first-year 
chemistry is concerned. These relationships 
were investigated for the period 1930—36, the 
years during which both sections of the 
Scholastic Aptitude Test were given. 

[he situation is an individual one in each 
iniversity, so that results obtained at one 
university are not necessarily directly com- 
parable with similar results obtained at other 
universities. Yet, the findings at the Univer- 
sity of Pennsylvania, in the sense that they 
indicate trends, will be of value to other 
interested groups outside this university. 
























THE Courses STUDIED 





Iwo one-semester chemistry courses, gen- 
erally studied in the freshman year, were in- 
cluded: Chemistry 1 (General Inorganic 
Chemistry) and Chemistry 3 (Qualitative 
Analysis). Chemistry 1 is a prerequisite for 
Chemistry 3. 










THE Data 





All the data for this investigation were ob- 
tained directly from the records of the indi- 
vidual students. Likewise, as far as the Uni- 
versity of Pennsylvania is concerned, the data 
do not represent a sampling—all students 
who came within the scope of the investiga- 
tion, that is, all students who had been ad- 
mitted to the University of Pennsylvania dur- 
ing the years from September, 1930, to Feb- 
ruary, 1936, and who, up to June, 1936, had 
taken one or both of the two chemistry 
courses studied, were included. 


In order to obtain the data, all the under- 
graduate schools* of the University were vis- 
ited and the records of their former and 
present students examined. When the record 
of a student was found who had been ad- 
mitted during the period between the aca- 


*The College (for men), The College of Liberal Arts for 
Women, The School of Fine Arts, The Moore School of Elec- 
trical Engineering, The School of Education, The Towne Sci- 
entific School oe peo Chemical, Civil, and Mechanical 
Engineering), and The Wharton School of Finance and 


Commerce. 
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demic years 1930-31 and 1935-36 and who 
had taken one or both of the two chemistry 
courses, the name of the student, his school, 
whether he had studied Chemistry 1 or both 
Chemistry 1 and Chemistry 3, and his final 
mark in each of these courses were noted. 
Then the admission slips of these students 
were located and their scores on the verbal 
and mathematical sections of the Scholastic 
Aptitude Test recorded. 

In this way, a total of 2603 cases, or marks, 
were obtained for investigation, of which 
1742 were in Chemistry 1 and 861 in Chem- 
istry 3. About fifty per cent of the cases in 
Chemistry 1 came from The College, about 
thirty-five per cent from The Towne Scien- 
tific School, with the remainder distributed 
among the other schools; whereas about 


sixty-five per cent of the cases in Chemistry 3 
came from The College, about twenty-five per 
cent from The Towne Scientific School, with 
the remainder distributed among the 
schools. 


other 






STATISTICAL ‘TREATMENT 


The relationship between test scores and 
course marks was determined by calculating 
the coefficients of correlation between the 
variables involved by means of the Pearson 
product-moment method. Coefficients of cor- 
relation were computed between the scores on 
the verbal and mathematical sections of stu- 
dents admitted to the University of Pennsyl- 
vania in each of the six academic years cov- 
ered in this investigation, 1930-31 to 1935- 
36, inclusive, and the marks they later made 
in the courses under consideration. Four such 
correlations were made for each year—two 
between verbal test scores and marks in each 
of the courses and two between the mathe- 
matical test scores and marks in each of the 
same courses (verbal test scores and mathe- 
matical test scores refer, respectively, to the 
scores on the verbal and mathematical sec- 
tions of the Scholastic Aptitude Test). 


A similar set of four correlations was cal- 
culated for the entire period 1930—36, regard- 
less of the year of admission. Separate cor- 
relations by years were made in addition to 
those for the entire period because it was felt 
that the latter, while valuable as a summary 
of what had happened, would not disclose any 
trends or variations that might have occurred 
during the years. 
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It will be noted, likewise, that separate cor- 
relations were calculated between test scores 
and marks in each of the two chemistry 
courses. rather than between test scores and 
the average marks of the two courses. The 
latter procedure would have yielded a con- 
fused picture of the relationships sought. A 
relationship determined for the average of 
two courses could not very well be applied to 
either of the two courses included in the 
average. 

The number of cases that were available 
for each of the correlations by years and for 
the entire period may all be determined from 
Table I. The number of cases for a course 
in a particular year may be obtained from the 
column for that year, and for the entire 
period 1930-36 from the last column on the 
right. 
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RESULTS AND CONCLUSIONS 

The results obtained in this investigation 
are presented in Tables II—VII, inclusive 
Tables II, III, and IV contain the detailed 
results by years and Tables V, VI, and VI! 
the corresponding summary results for the 
entire period 1930-36. Tables II and \ 
show the coefficients of correlation between 
marks in. each of the courses and scores on 
the verbal and mathematical _ sections, 
Tables III and VI the means of the course 
marks and the means of the scores on the 
verbal and mathematical sections, and 
Tables IV and VII the standard deviations 
of the course marks and the standard devia- 
tions of the scores on the verbal and mathe- 
matical sections. 

Tables II—-VII, inclusive, contain certain 
abbreviations the meanings of which are 
necessary in order to understand the tables. 


TABLE I 


DISTRIBUTION OF CASES BY COURSES IN EACH ACADEMIC YEAR FOR THE 
ENTIRE PERIOD 1930—36 





Course 1930-31 1931-32 1932-33 1933-34 1934-35 1935-36 Tota! 

aes se ibitie teh nlcanancleaionsiaain 261 299 302 324 293 263 1742 

gen a ee ae 127 152 171 183 168 60 861 

Total - - _-. 888 451 473 507 461 323 2608 
TABLE II 


CORRELATIONS BETWEEN MARKS IN THE CHEMISTRY COURSES AND SCORES ON THE VERBAL AND 





MATHEMATICAL SECTIONS OF THE SCHOLASTIC APTITUDE TEST FOR EACH 
ACADEMIC YEAR IN THE PERIOD 1930-36 











1930-31 





















































1931-32 1932-33 
Course Vv. Ss. | M. S. vV.8 M. S. Vv. Ss. M. S. 
r | P.E.| r |PE.| r |P.E| r |PE| r |PEJ r IPE 
C.1 302) .038) .526) .030) .286) .037| .422) .032| .221) .037| .381| .033 
C. 3 "224 (056 .415| .049| .193] .053| .309| .050) .181| .050| .283) .047 
1933-34 1934-35 1935-36 
Course V. S. M. S. V.S. M. S. V. S. M. S. 
r P .E. r Pp. E. r P. E. r P. E. r P. E. r Pp. E. 
C. 1 362) .082) .458) .030| .314) .036| .443| .031/ .107| .041|) .366] .036 
C. 3 "256, |047| .329| .044) .158| .051| .267| .048|—. 042} .087] .389] .074 
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APTITUDE AND MARKS 


[he meanings of these abbreviations are 


given here: 
C = chemistry 
V.S. = verbal section 
M.S. = mathematical section 
r= coefficient of correlation 


P.E. = probable error of the coefficient of 


correlation 
X’ = mean of the course marks 


TABLE III 
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Y’ = mean of the scores on the verbal sec- 
tion or the mathematical section of 
the Scholastic Aptitude Test 

¢x= standard deviation of the course 
marks 

¢y = standard deviation of the scores on 
the verbal section or the mathemat- 
ical section of the Scholastic Apti- 
tude Test 


MEANS OF THE COURSE MARKS AND THE MEANS OF THE SCORES ON THE VERBAL AND MATHE- 
MATICAL SECTIONS OF THE SCHOLASTIC APTITUDE TEST FOR EACH 


ACADEMIC YEAR IN THE PERIOD 1930—36 






































| 1930-31 1931-32 1932-33 
Course | v.s. | M.S | v.s. | M.S. | | v.s. | M.S. 
x ; | 4 rt + bP Y’ 
ecunssintastaiiediauaianame - PO Se a a a fn anee 
C.1 4.2 | 45.0 43.5 4.3 46.5 45.0 5.0 43.0 39.5 
C.3 4.8 | 49.0 46.5 4.7 50.5 45.5 4.9 45.5 41.0 
—- 
1933-34 1934-35 1935-36 
Course V.s | M.S v.s. | M.S | | V.s. | M.S. 
x’ - : x’ Y’ ne ae ae Y’ 
C.1 4.8 | 41.5 40.0 4.6 42.5 41.5 4.6 42.5 43.0 
C. 3 4.9 | 41.5 41.0 4.9 46.5 44.5 4.5 45.0 51.5 
TABLE IV 


STANDARD DEVIATIONS OF THE COURSE MARKS AND THE STANDARD DEVIATIONS OF THE SCORES 


ON THE VERBAL AND MATHEMATICAL SECTIONS OF THE SCHOLASTIC APTITUDE TEST FOR 
EACH ACADEMIC YEAR IN THE PERIOD 1930-36 
























































1930-31 1931-32 1932-33 
Course V.S. M.S. V.S. M.S. ; Vom M.S. 
Cx Cy Gy Cx Gy dy Ox ay a, 
C. 1 2.74 29. 30 30. 00 2.67 27. 00 28. 55 2.44 27. 80 29. 85 
C. 3 2. 82 29.20 30. 70 2.72 26. 05 28. 20 2.76 27. 80 29. 55 
1933-34 1934-35 | 1935-36 
Course V.s. | MS. v.s. | M.S. v.s. | M.S. 
ox Cy Cy Cx Cy | Gy Ox Cs Cy 
C.1 2. 64 28. 40 29. 20 2.67 27.45 27.75 2.61 28. 05 27. 30 
C.3 2.68 28.75 28. 90 2. 54 27. 30 28.90 2.39 27. 55 29.15 
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TABLE V 


SUMMARY OF CORRELATIONS BETWEEN MARKS 
IN THE CHEMISTRY COURSES AND SCORES ON 
THE VERBAL AND MATHEMATICAL SECTIONS 
OF THE SCHOLASTIC APTITUDE TEST FOR THE 
ENTIRE PERIOD 1930—36 























Verbal Mathematical 
section section 
Course - 
r BA * | r P. E. 
C.1 . 253 .015 . 423 .013 
C.3 . 189 . 022 .315 . 021 
TABLE VI 


SUMMARY OF THE MEANS OF THE COURSE 
MARKS AND THE MEANS OF THE SCORES ON 
THE VERBAL AND MATHEMATICAL SECTIONS 
OF THE SCHOLASTIC APTITUDE TEST FOR THE 
ENTIRE PERIOD 1930-36 


v.sS. | M.S 
Course | X’ = = 
saat aamceamanes iaWrcsiax alien 
C.1 | 4.6 43.5 | 42.0 
C.3 | 4.8 | 46.0 | 44.0 
| | 
TABLE VII 


SUMMARY OF THE STANDARD DEVIATIONS OF 
THE CoURSE MARKS AND THE STANDARD 
DEVIATIONS OF THE SCORES ON THE VERBAL 
AND MATHEMATICAL SECTIONS OF THE SCHO- 
LASTIC APTITUDE TEST FOR THE ENTIRE 
PERIOD 1930-36 








| v.s M.S 

Course Ox Gy Cy 
ee | 

C.1 2.65 28.05 28.85 

C.3 2.70 28.00 29.35 








Analysis by Years—See Table II 

1. The correlations on the mathematical 
section are higher than those on the verbal 
section in both Chemistry 1 and Chemistry 3 
each year. 

2. The correlations are lower for Chem- 
istry 3 than for Chemistry 1 in every case on 
both the verbal and the mathematical sec- 
tions, with the exception of the mathematical 
section for the 1935-36 group. 
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Analysis over the Entire Period 1930—36~ 
See Table V 


1. The correlations on the mathematical 
section are higher than those on the verbal 
section for both courses, when considering 
results over the entire six-year period. 

To determine the reliability of the differ- 
ence between the obtained coefficients of cor- 
relation for the verbal and mathematical sec- 
tions, the ratio of the difference of the two 
coefficients to the probable error of their 
difference* was calculated for each course. 
In both Chemistry 1 and Chemistry 3, the 
ratios so found were greater than four, indi- 
cating that there is complete certainty (100 
chances in 100) that the obtained difference 
represents a true difference greater than zero.’ 

2. The correlations on both the mathe- 
matical and verbal sections are lower for 
Chemistry 3 than for Chemistry 1 in every 
year (with the exception of the mathematical 
section for the 1935-36 group) and for the 
entire six-year period. The lower correlation 
in the second semester of the course is prob- 
ably due to the greater homogeneity of the 
students in the second half. If the means of 
the scores of the students on the mathemat- 
ical section in Table VI are compared for 
Chemistry 1 and Chemistry 3, and similarly 
for the verbal section, it is seen that these 
means are higher for Chemistry 3 than for 
Chemistry 1, thus showing that in the second 
half of the course the students represent a 
group higher in those abilities tested by the 
Scholastic Aptitude Test, due to the elimina- 
tion of the students who failed at the end of 
the first half. 

3. The correlations on the verbal section 
are low for both Chemistry 1 and Chemistry 
3, but those on the mathematical section are 
much better. It must be remembered that 
we are dealing with a single instrument and 
that we are on the college level, where the 
student personnel is more homogeneous than 
on the secondary level. The obtained co- 
efficients of correlation are also lower than 
they otherwise would be, because not all the 
candidates who apply for admission and take 
the Scholastic Aptitude Test are admitted, 
the poorer ones being rejected. The more 

‘ D 

PE (diff. r, —r,) 

5 Henry E. Garrett, Statistics in Psychology and Education, 
ew Table XV. New York: Longmans, Green and ( 


* Twenty-Sixth Amnual Report of the College Entrance 
Examination Board, 1926, p. 8. 
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homogeneous the group, the greater is the re- 
striction in the range of ability, and this re- 
striction tends to lower the coefficients of 
correlation." 

4. Beginning with the June, 1936, Scho- 
lastic Aptitude Test, taken in the main by 
candidates for admission in September, 1936, 
the College Entrance Examination Board 
dropped the mathematical section. This 
move by the Board did not affect any of the 
students considered in this study, since the 
study concerns itself with students admitted 
between September, 1930, and February, 
1936, during which time the verbal and 
mathematical sections did not change and 
were taken by all applicants, 

; In place of the mathematical section, the 
Board is offering a new series of mathemat- 
ical examinations, called Alpha, Beta, and 

Ruth K. Byrns, ‘‘Scholastic Aptitude and Freshmar 

ement,”’ School and Society, XXXV (May 21, 1932) 


4 B. Crawford, ‘‘Forecasting Freshman Achievement 
School and Society, XXXI (January 25, 1930), 129. 
Frank S. Freeman, “Predicting Academic Survival,’ Jour 
nal of Educational Research, XXIII (February, 1931), 116 
David Segel, Prediction of Success im College, p. 19. U.S 
fice of Education Bulletin, 1934, No. 15. Washington 
{ Education. 
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Gamma.* Since the Board cannot compel the 
acceptance of its offerings, it remains for the 
individual university to decide upon their 
acceptance or rejection. 

The results obtained on the mathematical 
section in this investigation show sufficient 
promise to indicate the desirability of the 
continued use of such a section in the Scho- 
lastic Aptitude Test for the guidance and 
teaching of students in courses in chemistry. 
Since the original mathematical section is no 
longer available, and since the College En- 
trance Examination Board is now offering 
the new mathematical examinations (Alpha, 
Beta, and Gamma) in its place, it is sug- 
gested that these new examinations be given 
a trial. It would remain, however, for a 
future study similar to this one, by compar- 
ing the results of such a study with those 
found here, to determine whether the new 
mathematical examinations are more closely 
related to marks in chemistry than was the 
original mathematical section. 


*“Report of the Commission on Examinations in Mathe- 
matics,’ Mathematics Teacher, XXVIIL (March, 1935) 
154-66 
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A STUDY OF AURAL LEARNING WITH AND WITHOUT 
THE SPEAKER PRESENT' 
J. Epwin LopER 


Principal of Havelock High School 
Lincoln, Nebraska 


INTRODUCTION 


The quantity and quality of aural learning 
present problems wherever there is a radio 
and learning is the primary objective. Sound 
equipment is common in schools and yet few 
of the problems concerning its efficient use 
have been solved. As a matter of fact, with 
the speaker in the room no universally ac- 
cepted best method of obtaining the greatest 
quantity and quality of aural learning has 
ever been established. Aural learning prob- 
lems, then, have been multiplied by the in- 
troduction of loud speaker presentations. 

The researches in the field represented by 
these problems have helped in only a small 
way to untangle and solve them. After study- 
ing the work which has been done with the 
problems that could be solved in the field, as 
presented by W. W. Charters in his pam- 
phlet, Research Problems in Radio Educa- 
tion,? one is overwhelmed with the lack of 
information. More facts are needed concern- 
ing the learning of information over the loud 
speaker in order to establish scientific foun- 
dations for all its predictions.* 

A great many of these facts fall directly 
in the field of psychology. What facts are 
needed, the gathering of these facts, and 
their verification are a challenge to those in- 
terested in psychological research. This prob- 
lem was suggested to the writer by a state- 
ment made by Professor D. A. Worcester* in 
the closing of a paper read to the 1932 
National Institute For Education By Radio. 
In connection with his report on certain in- 
vestigations he made this statement, “I do 
not know whether or not the scores on the 
various materials reported in the paper would 


The complete study is on file in the office of Dr. D. A. 

Worcester, Teachers College, University of Nebraska. 

2>W. W. Charters, Research Problems in Radio Education 
Information Series No. 4. New York: National Advisory 
Council on Radio in Education, 1932 

*L. Tyson and W. J. Donovan, Retrospect and Forecast in 
Radio Education. Chicago University of Chicago Press 
Pp. 254 

*D. A. Worcester, “What People Remember from Radio 
Programs,"’ Education on the Air, 1932, p. 228. Columbus, 
Ohio: Ohio State University Press 
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have been better had the participants been 
listening to a speaker in the room rather than 
over the radio.” 


THE PROBLEM 


This study is concerned with the learning 
and retention of factual material presented 
over the loud speaker compared with the 
learning and retention of factual material 
presented directly by the speaker himself. 
Does it make a difference in the amount 
learned and retained whether or not_ the 
speaker is in the learner’s presence? 


THE PROCEDURE 
The Pian 

Four lessons were taught by two types of 
presentation and a comparison made of the 
learning. Four hundred and forty-nine 8th 
grade pupils in seven junior high schools in 
the Public Schools of Lincoln, Nebraska, 
were the subjects in this experiment. Their 
scores on experimental tests were studied in 
the light of common items of information 
concerning them on their school census 
sheets, and other information thought to be 
pertinent. 

After a pre-test was given, four lessons 
were presented to the pupils, upon which they 
were tested immediately after each lesson and 
at two later dates. In presenting each lesson 
in each experimental set-up there were two 
groups of pupils. One group heard the lesson 
directly from the speaker with the speaker 
in their presence, while the other group heard 
the presentation simultaneously in another 
room over a loud speaker. 


Tests 

All tests were of the true-false type. The 
first or immediate retention tests, given im- 
mediately after each lesson, were constructed 
first, and the questions were based directly on 
the factual material in each lesson. The pre- 
test and the second and third retention tests 
were the same except for the directions and 
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certain questions which gathered information 
concerning a radio in the pupil’s home and 
concerning the reading of related informa- 
tion. Twenty-five questions were taken 
‘rom each of the four immediate retention 
tests to make up the 100 questions in each 
f these tests. 

[here were 43 questions in the immediate 
retention test on the first lesson, and 54 ques- 
tions on the immediate retention test after 
the second lesson. There were 50 questions 
in each of the first or immediate retention 
tests following the third and fourth lessons. 

In order to eliminate errors all tests and 
their directions were carefully checked by 
several persons in the field of tests and 
measurements. 


Time of Experiment 

The pre-test and the last two retention 
tests required not more than 30 minutes to 
answer. Each lesson was approximately 15 
minutes in length and its first or immediate 
retention test took about 15 minutes of the 
pupils’ time. Therefore, no period of testing, 
or teaching and testing, was over 30 minutes 
in length. 

The experiment took place during the 
spring semester of the 1936—37 school year. 
The administrative staff and faculty of the 
Lincoln Schools were very cooperative 
throughout the experiment. 

Subject Matter 

The subject matter chosen for the lessons 
was selected from a course of study on nar- 
cotics, constructed by the writer with the 
assistance of a committee of junior high 
school teachers. The committee exercised 
care in placing the materials on the proper 
grade levels. Therefore, since the lessons for 
this experiment were taken directly from the 
course, the lessons were on the learners’ level. 


Lessons 


Because the lecture or straight-talk type of 
presentation is very common in the class 
room and over the radio it was chosen as the 
type of presentation for this experiment. The 
purpose of the first lesson was to establish 
some basic concepts concerning narcotics, 
their advertising, and the qualities to expect 
in an authority on narcotics. The purpose 
of the second and third lessons was to present 
certain factual material concerning alcohol 
and tobacco. The purpose of the fourth les- 
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son was to present facts concerning narcotics 
other than alcohol and tobacco. Suggestions 
by Lumley® and Denison® concerning the con- 
struction and giving of talks to be heard over 
the loud speaker were followed as they ap- 
peared to apply to this experimental situa- 
tion. In no way, however, were the lessons 
made to be super-impressive, as many talks 
are, which one is likely to hear over the radio. 
The lessons more nearly resembled talks one 
is likely to hear in the ordinary class room. 


Time Between Tests 

The pupils knew they were to be tested on 
the lessons and whenever they asked ques- 
tions about their scores they were encouraged 
and were promised their scores after the last 
test. 

The pre-test was given from 1 to 3 days 
before the first lesson. The time between 
lessons one and two was 1 to 4 days; between 
lessons two and three, 1 to 3 days; and be- 
tween lessons three and four, 1 to 5 days. 
The first retention tests were given immedi- 
ately following each lesson. The second re- 
tention test was given 21 to 28 days after 
the last lesson. The thire and last retention 
test was given 16 to 21 days after the second 
retention test. The last retention test was 
given an average of 44 days after the last 
lesson. 


Presentation of Lessons 


The lessons were presented in the same 
way in the ten experimental set-ups. The 
speaker stood so that he would be about 
eight inches from the microphone and could 
talk in a natural tone. No effort was made 
to be impressive or oratorical. However, an 
effort was made to accentuate the thought or 
idea in each sentence. Care was used to 
pronounce words clearly and distinctly. 


Equipment 

The amplifying unit, loud speaker, micro- 
phone, and cable were all used as an ampli- 
fying system a year prior to their use in the 
experiment. The microphone was a high 
grade carbon microphone which worked very 
satisfactorily with the other equipment. Care 
was exercised in the pronunciation of words 
which carried an “s’’ or an “sh” sound, since 
the carbon microphone has a tendency to 


5F. H. Lumley, “Radio’s Rate of Speech,’’ Educationai 
Research Bulletin, XII (November, 1933), 244-46. 

® Merrill Denison, The Educational Program. New York 
The Radio Institute of the Audible Arts, 1936. Pp. 11. 
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make such sounds annoying from the loud 
speaker. The experimenter had experience 
talking over this particular microphone sev- 
eral times a month for a year prior to this 
experiment. ‘The reports from the children 
and the teachers indicated that the loud 
speaker productions were clear and audible. 
The microphone cable was shielded in an 
effort to eliminate interference. The loud 
speaker was an eight inch dynamic speaker, 
common at the present time in most of the 
ordinary home radio sets. The amplifying 
unit was capable of developing twelve watts, 
which, of course, is a great deal more than is 
needed to run one loud speaker. 

During the entire experiment the tone con- 
trol and power rheostat were set so that the 
quality and volume of the voice heard over 
the loud speaker were the same for all groups. 
There was a slight transformer hum audible 
in any part of the reception rooms. While 
this was undesirable as far as making the 
loud speaker presentation perfect is con- 
cerned, it was the same for all groups. Pupils 
reported that the hum was not objectionable 
and did not interfere with their understand- 
ing. Since the schools in Lincoln did not 
have public address systems it was necessary 
to have portable equipment for this experi- 
ment. It was easily set up and very durable. 


Experimental Technique 

Many difficulties were encountered in at- 
tempting to get experimental groups in the 
various schools which would continue as 
whole groups throughout the time necessary 
for the experiment. Chief of these difficulties 
was absences. Actually 639 pupils partici- 
pated in the experiment, but the results from 
only 449 were used, principally because of 
absence which made the records for some 
incomplete. 

Of these 449 pupils 207 were in Group I 
and 242 were in Group II. ‘The group rota- 
tion technique was selectec as best fer the 
experiment. The types of presentation were 
rotated between the groups so that all the 
pupils heard half the material by each 
method, alternation taking place throughout. 

The letter “A” refers to the presentation 
given directly from the speaker and the letter 
“B” refers to the presentation over the loud 
speaker. The results of each, A and B, are 
based on two test responses per pupil. There- 
fore, where A and B groups are compared, 
there are twice as many pupil responses as 
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there are actually pupils taking part. | 
example, 898 subject responses in Group A 
and 898 subject responses in Group B ar 
compared in Table Il. It is well to keep this 
fact in mind in reading the tables, since th: 
numbers in the tables, except Table I, indi- 
cate pupil responses rather than pupils. 


Natural School Conditions 

The conditions under which the experiment 
took place were natural for the pupils. Reg- 
ular class groups within the schools, with 
regular teachers in the rooms, made up the 
groups for the experiment. These teachers 
reported that the learning situations were 
very good. 

One person, the writer, gave all the lessons 
and administered all the tests, thus eliminat- 
ing the errors that might be introduced by a 
number of experimenters taking part. Every 
effort was made to have all experimental set- 
ups the same. 

The Subjects 

The subjects for the experiment wer 
normal eighth grade pupils from the seven 
junior high schools located in the various 
parts of the city. In one school there were 
three experimental set-ups, and in another 
school there were two experimental set-ups. 
However, they are listed as ten different 
schools rather than set-ups in order not to 
disclose the identity of the schools. 


Collecting and Computing Data 

The tests were corrected by trained assist- 
ants and checked for accuracy. ‘The scoring 
was done by subtracting the wrongs from the 
rights and then transposing the answer into 
a percentage score. All data were collected 
and recorded on cards. After the data were 
recorded the information on the card was 
coded and punched in Hollerith machine 
cards. A great deal of the computing was 
done with the use of the Hollerith machine 
through the International Business Machines 
Corporation. 


DATA PERTAINING TO THE LEARNING BY TH! 
Two Types OF PRESENTATION 

It is the purpose in this section to set forth 
the findings relative to the learning of the 
pupils by beth types of presentation. Besides 
showing the Group I and II differences and 
the way they were combined by the rotation 
technique to show the Group A and Group } 
























there are twelve minor studies 


differences, 
reported relating to the principal problem. 


Gri ups I and I] 

The means shown in Table I were found 
by combining the Group I scores from the 
ten school groups and the Group II scores 
from the ten school groups. This table makes 
it possible to study the mean differences be- 
tween the combined Group I scores and the 
combined Group II scores. The table is 
divided into four parts. Each part contains 
the means on those parts of the tests refer- 
ring to each of the four lessons. The names 
of the tests are listed in the first column and 
the subscripts indicate to which lesson they 
refer. The actual number of pupils is entered 
in the third column. Mean differences are 
found in every case by subtracting the Group 
[ mean from the Group II mean. It is well 
to keep in mind in reading this table that 
the group differences have not been equalized. 
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The results, therefore, found in this table are 
from the groups as they were found in the 
schools. 


The first ratio of the mean difference, 
divided by the standard deviation of the dif- 
ference, in the column to the extreme right 
of the table shows that the difference be- 
tween Group I and Group II means on the 
first pre-test is not statistically significant. 
On the first or immediate retention test 
Group I is superior, and their superiority is 
statistically significant, as shown by the cor- 
responding ratio in the last column. Group | 
heard this lesson directly from the speaker 
and Group II heard the lesson over the loud 
speaker. On both of the last retention tests 
on this lesson the ratios indicate that the 
group differences are not statistically sig- 
nificant. 


In the second part of the table on the pre- 
test, the mean difference is statistically in 





TABLE I 
A COMPARISON BY LESSONS OF GROUP I AND Group II Test Score MEANS 
by A AND B PRESENTATIONS 
Standard Diff. 
Test Group No. Mean Deviation om Diff. + Diff. ¢ Diff. 
Pre-test, *I 207 47.94 18.7 1.34 
II** 242 47.98 13.6 . 90 04 1.6 ~. 02 
Retention I 207 61. 66 20.0 1.41 
Test I, II 242 55. 54 18.0 1.2 6.12 1.85 3.31 
Retention I 207 63. 82 17.4 1.24 
Test II, II 242 64. 02 17.7 1.18 20 1.7 12 
Retention I 207 63. 33 16.3 1.16 
Test III, II 242 61. 26 22.6 1.5 2.07 1.9 1.09 
Pre-test, [* 207 33. 84 17.3 1.23 
*II 242 35. 38 17.4 1.16 1. 54 1.68 —.92 
Retention I 207 §2.15 21.1 1.5 
Test I, II 242 63. 66 19.4 1.29 11. 51 1.96 -5. 87 
Retention I 207 41.47 16.9 1.2 
Test II, II 242 49.85 20.6 1.36 —8.38 8 ~4. 66 
Retention I 207 42.21 19.1 1. 36 
Test ITI, II 242 50. 38 20.4 1.36 —7.17 2 —3. 59 
Pre-tests, [* 207 33.45 16.8 1.3 
*II 242 31.85 16.2 1.07 1. 60 1.6 1. 
Retention I 207 58. 72 27.0 1.9 
Test I, II 242 67.07 26.0 1.7 —8.35 2. 55 —3. 27 
Retention I 207 43.79 17.4 1. 28 
Test II, II 242 46.79 12.8 . 85 —3. 00 1.5 —2. 
Retention I 207 43. 68 18.7 1.33 
Test ITI, II 242 46. 67 17.9 1.19 —2.99 1.78 -1. 68 
Pre-test, *I 207 37.18 19.3 1.38 
II** 242 39. 33 28.7 1.91 —2.25 2.35 —. 96 
Retention I 207 60. 83 20.6 1. 41 
Test I, II 242 56. 52 17.0 1.13 4.31 1.8 2.39 
Retention I 207 62. 55 20.1 1.42 
Test II, II 242 61. 81 20.9 1.39 74 1.98 . 87 
Retention I 207 66. 69 18.8 1. 34 
Test III, II 242 66. 03 19.5 1.3 . 66 1. 85 . 36 
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significant, but for the three retention tests 
Group II, which heard lesson two directly, 
has significantly higher scores. 

The superiority of this method is again 
demonstrated in these unmatched group 
scores by the retention test scores following 
the last two lessons shown in the last two 
parts of the table. 

Since no effort was made to allow for the 
differences in Group I and Group II in re- 
cording and computing these scores, the supe- 
riority of the direct type of presentation may 
not be real, due to the differences in the 
groups. Likewise, retention test scores are 
colored by differences on the pre-test scores. 
In recording the data in this way no allow- 
ances have been made for any test or lesson 
difficulties which may have existed. 

In order to get a more accurate, simpler, 
and a more nearly composite picture of the 
results from the two types of presentation, 
Table II is constructed to show the mean of 
the combined test scores covering the mate- 
rial given by the direct method, compared 
with the means of the test scores covering the 
material given by the loud speaker presenta- 
tion combined. These scores are combined 
following the rotation technique described in 
the section on “Procedure.” The combina- 
tion of the scores is indicated in Table I by 
the asterisks in the column headed, “Group.” 
The single asterisk is placed by the four divi- 
sions of the pre-test combined to show in 
Table II the mean A score on that part of 
the pre-test covering the material heard later 
directly from the speaker by both groups. 
The double asterisk is placed by those pre- 
test. group scores combined to show in Table 
II the mean B score on that part of the four 
divisions of the pre-test covering the material 
heard later over the loud speaker by both 
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groups. The same combination is applied ty 
the retention tests to show the combined 
means in Table II. 
Groups A and B 

In Table II the differences of the mean test 
scores on the types of presentation, A and Bb. 
are shown. On the pre-test, covering presen- 
tation A material, the group had a mean scor 
of 37.74, with a standard deviation of 17.88. 
The test covering the B type of presentation 
material had a mean score on the pre-test of 
39.04, with a standard deviation of 18.95 
The mean differences, where A and B aver- 
ages are compared, are found by subtracting 
the mean of B from the mean of A. The 
mean difference between the two groups on 
the pre-test is —1.3, which indicates that the 
difference is in favor of the material given by 
A, or the direct presentation. Since the mean 
difference divided by the standard deviation 
of the difference is —.72, as shown in the last 
column in the table, there is no significant 
statistical difference between the groups on 
this test prior to their hearing the lessons. 

On the immediate, or first retention tests, 
given immediately following each lesson by 
the A presentation, the group averaged higher 
than by the B presentation. The ratio re- 
sulting from the difference divided by the 
standard deviation of the differences is 3.78 
which shows the difference between the aver- 
ages to be decidedly significant. It appears 
that the experimental factor, namely, that of 
the presence of the speaker is responsible for 
this impressive difference. On the last two 
retention tests by the A presentation the 
group averaged higher than by the B, or loud 
speaker presentation, but the ratios indicate 
that their difference is not significant. 

It is interesting to note that the marked 
difference at the end of the first retention test 


TABLE II 


A COMPARISON OF MEANS ON TESTS COVERING LESSONS GIVEN BY THE A, OR DIRECT TYPE OF 
PRESENTATION, AND THE B, oR LOUD SPEAKER TYPE OF PRESENTATION 


Test Group 
Pre-test s 


Retention 
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Standard _Diff. 
Mean Deviation om Diff. c¢Diff. cDiff. 
37.74 17.88 1.24 ne 
39.04 1895 131 —130 180 —72 


63.47 21.51 1.45 7.71 2.04 3.78 


55.76 20.79 1.43 
55.18 17.73 1.22 . 
53.57 1823 1.25 61 1.75 = 
56.12 1837 1.26 

54.49 19.99 137 1-63 1.80 a 
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disappears to a large extent by the time of 
the last retention test. The disappearance of 
the difference between the two means might 
be due to a net gain of the low mean or a net 
loss of the high mean. In this case it is due 
to a net loss of the high mean, namely, the 
one for the group by the A presentation. On 
the first retention test the mean indicates that 
the pupils by the A presentation knew on an 
iverage 63.47 per cent of the material pre- 
sented, but by the time of the last retention 
test they had lost 11 per cent of what they 
knew. On the other hand, the group by the 
B presentation, the ones hearing the material 
over the loud speaker, lost only 2 per cent of 
what they apparently knew at the time of the 
first retention test. Therefore, while the 
group hearing the material directly from the 
speaker in his presence learned more than the 
group hearing the material over the loud 
speaker they likewise forgot more; in fact, so 
much that by the end of the third retention 
test, which was given 44 days after the first 
retention test, the difference between the 
learning of the two groups, as indicated by 
the means on this test, is not statistically 
significant. 

Assuming that the mean difference in 
favor of the material covered by the B pre- 
sentation on the pre-test is an advantage 
throughout the retention tests, it would in- 
crease the difference divided by the standard 
deviation of the difference ratio on these 
scores. This would make the superiority of 
the A presentation even more apparent, but 
it would not change the ratios on the last two 
retention tests sufficiently to make the dif- 
ference statistically significant. 


The group by the A presentation made the 
greatest net learning gain for the period of 
the experiment. They showed a profit of 
32.75 per cent more of the material at the end 
of the experiment. By the B presentation, 
the group that heard the material over the 
loud speaker showed a profit of 28.35 per 
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cent at the end of the experiment. The group 
by the A presentation showed a profit of 
4.40 per cent more than they did by the B 
presentation. 

By both the A and B presentations there is 
an increase in the mean score on the third 
retention test over the mean score on the 
second retention test. This two per cent in- 
crease for both groups is probably due to 
reminiscence. It is not due entirely to read- 
ing related material between the two reten- 
tion tests. This is shown in Table III. 

There was no suggestion in the lessons to 
the pupils that they read related information. 
However, it was anticipated that some of 
them might read and thus affect their scores 
on the last two retention tests. Question 501 
appeared in the second retention test and it 
asked pupils to answer in the affirmative if 
they had read related material between the 
lessons and the time of the second retention 
test. Similarly, question 502 on the third re- 
tention test asked pupils to answer in the 
affirmative if they had read related material 
in the interval between the two last retention 
tests. 

Table III makes it possible to study the 
A and B mean differences for the pupils who 
read between the last two retention tests and 
the group as a whole. As can be seen by the 
table, this group averaged higher than the 
group as a whole, of which they were a part. 
However, their superiority is not statistically 
significant. Their superiority is not sufficient 
to explain the increase in the mean score of 
the third retention test over the second re- 
tention test for the group as a whole. There- 
fore, this increase for the whole group may 
be due to reminiscence. 

When it is considered that reminiscence 
might occur in this special group of pupils 
as well as the group as a whole, some of the 
amounts of the scores may be due to this 
phenomenon. The intelligence of the groups 
was approximately the same. The group as 


TABLE III 


MEAN DIFFERENCES BY THE A AND B PRESENTATIONS FOR PUPILS WHO READ BETWEEN THE 
LAST RETENTION TESTS AND THE GROUP AS A WHOLE 


Test Group 
Retention ......__....(A—Pupils Who Read) 
> 2 (eee _.(A—Group as a Whole) 


(B—Pupils Who Read) 
(B—Group as a Whole) 


Standard _Diff. 
No. Mean Deviation om oDiff. 
176 58.40 23.50 1.77 1.05 
898 56.12 18.37 1.26 = 
176 54.51 19.35 1.46 01 
898 54.49 19.99 1.37 “ 
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a whole had an average intelligence quotient 
of 107; this smaller group, 108. The read- 
ing that they did must have supported the 
material learned directly from the speaker 
more than the material learned over the loud 
speaker. 

The group of pupils who reported reading 
between the last two retention tests is smaller 
than the group who reported reading between 
the first two retention tests. ‘Therefore, it 
seems that the question concerning the read- 
ing of related information in the first reten- 
tion test stimulated few pupils to read _be- 
tween the last two retention tests. A com- 
parison of the A and B mean scores for those 
who read related information in the interval 
between the first retention test and the second 
retention test with the group as a whole is 
shown in Table IV. 

In three of the four comparisons of means 
this special group made higher averages than 
the group as a whole; however, their superi- 
ority is not statistically significant. It is in- 
teresting to note that for this special group 
their B scores are higher than their A scores, 
which must be due to some factor such as 
reading. The fact that they average below 
the group as a whole on that part of the 
retention test III covering the material pre- 
sented directly, and that the difference be- 
tween the averages is almost equal to three 
times the standard error of the difference, 
seems to indicate that some factor such as 
reading has played a part. It is entirely 
possible that the reading of related infor- 
mation might reduce the scores, as well as 
raise them, since there is considerable liter- 
ature easily available in opposition to the 
facts given in the lessons, as well as support- 
ing such facts. In this regard the reading 
seemed to increase the standard deviation, 
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indicating a wider variation within this grow 
than the variation in the group as a whok 


School Marks 

The comparison of mean scores for thos 
pupils making high averages and those mak 
ing low averages in their school marks j. 
shown in Table V. 

The marks in the Lincoln schools are on » 
seven point scale. A mark of “1” is highesi 
a mark of “7” is failure. The high mari 
group includes those who have average mark: 
of either “‘1’s” or “2’s”, and in the low mar} 
group are those averaging ‘“5’s”, “6’s” an 
“9's”. The junior high school marks, jj 
plotted, would make a skewed curve, as car 
be seen by the number of pupils representing 
each level. The upper group contained 11; 
pupils, the low group 40 pupils, and th 
middle group 298 pupils. 

The pre-test scores, covering the materia 
presented by the loud speaker, are superior 
for both groups. On all the retention test: 
the mean scores by the A type of presentatio 
are superior for both groups. 

The (difference divided by the standard 
deviation of the difference) ratios, shown in 
the column to the extreme right in the table 
indicate that the pupils with high marks show 
a greater difference by the two types of pres- 
entation than do those with low marks, with 
the exception of the last retention test where 
there is a greater difference in the means for 
the pupils with low marks, as indicated by 
their ratio of 1.20. This greater difference o/ 
the means for the low group on the third 
retention test is possibly due to a difference 
between the groups in reminiscence. For the 
low mark group there is an increase of 2.76 
in the mean score on the third retention test 
over the mean score of the second retention 


TABLE IV 


MEAN DIFFERENCES BY THE A AND B PRESENTATIONS FOR PUPILS WHO READ BETWEEN THE 
First RETENTION TESTS AND THE GROUP AS A WHOLE 


Test Group 
Ee (A—Pupils Who Read) 
Test II - _........(A—Group as a Whole) 


(B—Pupils Who Read) 


(B—Group as a Whole) 


EE ncemusamel (A—Pupils Who Read) 
Test III __._........(A—Group es a Whole) 
(B—Pupils Who Read) 
(B—Group as a Whole) 


Standard _ Diff. 
No. Mean Deviation om oDiff. 
214 55.43 19.50 1.33 17 
898 55.12 17.72 1.22 7 
214 56.55 21.10 1.38 1.60 
898 53.57 18.23 1.25 ‘ 
214 50.85 18.00 1.23 9.99 
898 56 12 18.37 1.26 — 
214-6757 21.60 1.48 1.52 
898 54.49 19.99 1.37 pie 
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TABLE V 
A COMPARISON OF TEST MEANS FOR PUPILS OF HIGH AND Low SCHOOL MARKS BY 
A AND B PRESENTATIONS 
Standard Diff. 
Test Group Level No. Mean Deviation om Diff. Diff. Diff. 
wre.ten A High 222 41.02 17.35 1.16 
B High 222 43.73 2015 135 —~271 1.78 —1.52 
A Low 80 3290 19.90 1.34 e 
B Low 80 33.80 2015 1.35 90 190 —.47 
Retention A High 222 73.39 15.15 1.02 7 
Test B High 222 64.96 17.00 1.14 %438 153 = 5.51 
A Low 80 46.41 21.05 1.41 . ; 
B Low 80 4218 2035 137 *23 1.97 2.15 
Retention A High 222 6282 17.00 1.14 : 
Test I] B High 222 61.17 1930 130 «41:65 = =§61.78 95 
A Low 80 41.39 23.75 1.59 ' 
B Low 80 39.80 21.10 1.42 1:59 218 15 
Retention A High 222 63.43 17.95 1.20 . wh: ss 
Test III B High 222 6267 2000 1.34 = 
A Low 80 44.15 2265 152 o- af : 
B Low 80 4159 2255 151 78 214 1.20 


test for the A type of presentation, and only 

79 points difference between the mean 

res of the two retention tests for the B 
type of presentation. 

The net learning gain for the high group is 

; per cent, as shown by the difference be- 
tween their averages on the pre-test and the 
last retention test for the A type of presenta- 
tion. For the low mark group the net gain 
is 26 per cent, as shown by the difference be- 
tween the averages of the pre-test and the 
last retention test for the A type of presenta- 
tion. According to this interpretation, the 
high mark group profited more than the low 
mark group by having the speaker in their 
presence, 

For the presentation over the loud speaker 
the high mark group showed a profit of 31 
per cent, only two per cent less than the gain 
made by the direct presentation. The low 
mark group by the B type of presentation 
showed a gain of 18 per cent, or eight per 
cent less than by the direct method. 

The forgetting calculated from this table 
corresponds to the findings in the summary 
Table II concerning forgetting. Of what they 
knew of the material presented to them di- 
rectly, the high mark group forgot 14 per 
cent in the interval between the first reten- 
tion test and the last retention test, a period 
of 44 days. Of what they knew of the mate- 
rial presented to them directly, the low mark 
group forgot only three per cent in the in- 


terval between the first retention test and the 
last retention test, a period of 44 days. For 
the material learned over the loud speaker 
the high mark group forgot on the last reten- 
tion test only four per cent of what they knew 
at the end of the first retention test. 

Marks when grouped in this way are a 
selective factor in learning by these types of 
presentation. The pupils with high marks are 
less affected in their learning by the loud 
speaker than the low mark group. However, 
the high mark group forgot a higher percent- 
age of the material learned by the direct 
method of presentation. 


Averages by Schools 

The averages by schools were studied, 
which indicated that the schools differed on 
the amounts they knew before the experiment 
started, and that the general trends as re- 
ported concerning the whole group are present 
in the separate school groups. 


Intelligence Levels Compared 

Do pupils of a higher intelligence quotient 
continue to show their superiority in learning 
from these types of presentation, and how do 
those of different intelligence levels respond 
to the experimental factor? Data on the 
questions are shown in Table VI. The pupils’ 
scores were grouped according to three intelli- 
gence quotient levels. The low group in- 
cluded the scores made by those whose intel- 
ligence quotients were 94 and below. The 
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TABLE VI 
MEAN Test Score DIFFERENCES BETWEEN DIFFERENT INTELLIGENCE LEVELS FOR 


A AND B PRESENTATIONS 


Test Group LQ. No. 
Pre-test _ A Low 156 
Med. 332 

High 410 

B Low 156 

Med. 332 

High 410 

Retention A Low 156 
Test I on Med. 332 
High 410 

B Low 156 

Med. 332 

High 410 

Retention » Low 156 
Test II Med. 332 
High 410 

B Low 156 

Med. 332 

High 410 

Retention kinds ae Low 156 
Test III ‘ Med. 332 
High 410 

B Low 156 

Med. 332 

High 410 


middle group consisted of all the scores be- 
tween 95 and 109, and the high intelligence 
group those with intelligence quotients of 110 
and above. 

In reading the table for the mean differ- 
ences, the standard errors, and the (difference 
divided by the standard deviation of the 
difference) ratios, the first number in each of 
these columns is the middle group compared 
with the low group. The second number is 
the high group compared with the middle 
group, and the third number is the high group 
compared with the low group. 

In every case intelligence is a selective fac- 
tor; that is, those higher in intelligence made 
higher scores on the tests. The ratios in 
every case show the differences are statisti- 
cally significant. 


Intelligence and Presentations 

In Table VII mean scores for the tests cov- 
ering material taught by each type of pres- 
entation for each intelligence level are 
compared. 

On the pre-test the mean of the low group 
shows that they did not know as much of the 
material presented to them over the loud 


Standard Diff. 
Mean Deviation om Diff. cDiff. Diff. 
31.38 17.15 1.37 5.33 1.74 3.06 
36.71 19.40 1.06 4.29 1.38 3.11 
41.00 17.80 .88 9.62 1.63 5.90 


29.74 17.35 1.39 8.41 1.74 4.83 
38.16 19.15 1.05 4.71 1.37 3.44 


42.87 17.80 88 13.13 1.64 8.01 
47.97 19.15 1.53 11.83 1.79 6.61 
59.80 16.70 92 12.53 1.19 10.53 
72.33 15.15 -75 =—-24.36 1.70 14.33 
40.73 18.65 1.49 11.51 1.73 6.65 
52.24 16.10 88 12.09 1.20 10.08 
64.33 16.15 80 23.60 1.26 18.73 


42.07 19.80 1.59 10.97 1.92 5.71 
53.04 19.45 1.07 8.84 1.37 6.45 
61.88 17.30 85 19.81 1.80 11.01 


39.23 21.00 1.68 12.75 1.99 7.59 
51.98 19.55 1.07 8.33 1.40 5.95 
60.31 18.40 91 21.08 1.91 11.04 


45.07 20.40 1.63 7.91 1.95 4.06 
52.98 19.75 1.08 9.89 1.39 7.12 
62.87 17.90 88 17.87 1.85 9.66 


39.53 21.60 1.73 13.37 2.05 6.52 
52.90 20.30 1.11 8.57 1.44 5.95 
61.47 18.55 92 21.95 1.96 11.20 


speaker as they did of the material presented 
directly with the speaker in their presence. 
In the case of the middle group and the 
upper group the reverse is true. They knew 
more of the material presented over the 
loud speaker. However, in no group is the 
difference between the means statistically 
significant. 

On the first retention test the differences 
between the mean scores for each level are 
significant in favor of the direct method of 
presentation. On the last two retention tests 
the mean score differences are not significant, 
as shown by the ratios in the column to the 
extreme right of the table. They came closer 
to being significant on the second retention 
test for the low level group. 

The net learning gain on the A type of 
presentation for the low group, calculated 
from the pre-test mean to the second reten- 
tion test mean, is 30 per cent for the middle 
group. The net gain is 31 per cent, and for 
the high group the net gain is 35 per cent. 
The net learning gain for the B type of pres- 
entation, calculated in the same way for the 
low group, is 24 per cent; for the middle 
group, 26 per cent; and for the high group, 
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TABLE VII 
Test SCORES COMPARED FOR EACH INTELLIGENCE QUOTIENT LEVEL 
FoR A AND B PRESENTATIONS 
Standard _Diff. 
Test Group LQ. No. Mean Deviation om Diff. cDiff. oDiff. 
Pre-test A Low 156 31.38 17.15 1.37 - , 
B Low 156 29.74 1735 1.39 1:64 1.9 84 
A Med. 332 36.71 1940 106 14, 449 97 
B Med. 332 38.16 19.15 1.05 —_ ss ” 
A High 410 41.00 17.80 88 e ‘ 
B High 410 4287 17.80 '3g 187 1.24 —-1.50 
Retention A Low 156 47.97 1915 153 20, 14 3.40 
feat § 2.3 B Low 156 40.73 1865 149 ‘* 2. 3. 
A Med. 332 59.80 16.70 a om 61st 50s 
B Med. 332 52.24 16.10 .88 rS saa = 
A High 410 72.33 15.15 75 on 
B High 410 64.33 16.15 ‘39 «(800 1.10 7.27 
Retention A Low 156 42.07 19.80 1.59 » 84 » 3) 1.28 
Tek BE aacn B Low 156 39.23 21.00 1.68 i a4 : 
A Med. 332 53.04 19.45 1.07 . - 
B Med. 332 51.98 1955 107 706 151 0 
A High 410 61.88 17.30 85 en ox 9¢ 
B High 410 6031 1840 91 (‘257 1.25 1.26 
Retention —_- A Low 156 45.07 20.40 1.63 r ‘ oC 
Test III _-_- B Low 156 39.53 2160 1.73 54 238 238 
A Med. 332 52.98 19.75 1.08 r - 
B Med. 332 5290 2030 111 ‘8 1.58 ats 
A High 410 6287 17.90  .88 . 0 
B High 410 6147 1855 92 1240 127 1 
35 per cent. The middle and lower groups, topic of narcotics has been more oi a boy’s 


therefore, profited more by the speaker 
method than they did by the loud speaker 
method. The high group profited about the 
same for each method. 

In terms of loss, as shown by the difference 
between the first retention test and the last 
retention test, by the A method of presenta- 
tion the low group lost six per cent, the middle 
group lost 31 per cent, and the high group 
lost 13 per cent. The loss for the B type of 
presentation, calculated the same way, for the 
low group is three per cent, for the middle 
group one per cent, and jor the high group 
four per cent. 

In five out of the six cases the third re- 
tention test scores are higher than the second 
retention test scores, again showing that remi- 
niscence has occurred. 


Sex 

Table VIII shows the mean scores of the 
boys compared with the mean scores of the 
girls. It is indicated in every case that the 
test averages for the boys are higher than 
for the girls. This suggests that possibly the 


problem. 

The differences in their averages are signiii- 
cant on the immediate or first retention test 
only. However, on the pre-test, covering in- 
formation taught by the direct method, the 
boys’ superiority reduced to chances is 98 in 
100, and by material taught by the B type, 
the boys are superior 99 chances in 100. 

On the second retention test the boys are 
superior on the material taught by the direct 
method 99 chances in 100, and by the loud 
speaker method 93 chances in 100. 

On the last retention test, covering material 
taught by the A type of presentation, the 
boys are superior by 98 chances in 100, but 
on the test covering the B type of presenta- 
tion the boys are superior only 65 chances 
in 100. 

The odds, then, are definitely in favor of 
the boys in this experiment, if one does not 
allow for their superiority in the pre-test in 
the remainder of the tests. If their pre-test 
superiority is an advantage throughout the 
retention tests, then the net learning gain or 
profit, as shown between the pre-test and 
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TABLE VIII 


MEAN Test Score DIFFERENCES BY SEX FOR A AND B PRESENTATION 


Test Group Sex No. 
Pre-test _ A Boys 454 
Girls 444 

B Boys 454 

Girls 444 

Retention —— Boys 454 
Test I _ Se Girls 444 
Boys 454 

Girls 444 

Retention _ ; A Boys 454 
Test II Girls 444 
Boys 454 

Girls 444 

Retention Boys 454 
Test III Girls 444 
Boys 454 

Girls 444 


third retention test, is in favor of the girls. 
Making allowances for the advantage of the 
boys on the pre-test, the girls retained more 
than the boys on the material learned over 
the loud speaker, as shown by the last two 
retention test scores by the B method. The 
mean intelligence quotient for the boys and 
the girls was the same. 


Occupational Levels 

The test averages of children of parents, 
usually the father, on high and low occupa- 
tional levels are shown in Table IX. 


Standard Dit 
Mean Deviation om Diff. cDiff. cDif. 
38.82 18.6 87 
36.30 19.15 ‘91 +252 = =1.26 2.00 
40.82 19.15 90 
3722 1915 91 360 128 8 2.81 
65.80 18.35 86 ; 
61.08 18.75 ‘99 «472024 3.81 
58.43 18.55 ‘87 
53.03 18.65 "a9 «(530 1.25 4.24 
56.75 21.65 1.02 
63.56 1885 .89 219 1385 = 2.36 
54.34 2220 1.04 . 
52.30 20.30 "96 «C04 421 
57.45 20.70 97 ; 
54.72 19.40 ‘93 TS 42.08 
54.77 21.70 1.02 
54.20 20.95 ‘99 57 1.42 40 


The high occupational level includes classi- 
fications in the professional group only. The 
low occupational level includes those occupa- 
tions in the unskilled and semi-skilled groups 
The classification was made from the occupa- 
tion of parents listed on the children’s schoo! 
census sheets according to the occupational 
division set up by Arthur J. Jones.’ The 
averages on the low level are possibly more 
reliable, since this level includes almost twice 
as many subjects as in the high group. Since 


7 Arthur Jones, Principles of Guidance, p. 314. Nev 
York: McGraw-Hill Book C 0., 1930 


TABLE 'X 


MEAN Test Score DIFFERENCES FOR CHILDREN OF PARENTS ON HIGH AND LOW OCCUPATIONAL 
LEVELS BY A AND B PRESENTATIONS 


Test Group Level No. 
Pre-test A High 72 
B High 72 

A Low 184 

B Low 184 

Retention __-_- es A High 72 
ES anen oan High 72 
A Low 184 

B Low 184 

Retention —__- scans) <a High 72 
Test II __-- ; aoe High 72 
A Low 184 

B Low 184 

Retention -_--- ee High 72 
=e Bee... ane High 72 
A Low 184 

B Low 184 





Standard Diff. 
Mean Deviation om Diff. c¢Diff. Diff. 
40.15 19.35 2.28 
46.15 19.00 224 —800 3.20 —1.88 
36.28 19.00 1.40 . 
3423 1440 106 79 1.76 1.16 
69.84 17.05 2.01 a 
se.66 20.44 24) 118 314 = 3.58 
59.11 19.95 1.47 
60.57 17.7% 181 854 197 4.34 
61.79 18.45 2.17 
57.86 19.00 224 93 312 1.26 
52.97 21.35 1.57 
47.69 4092040 «61.50 428 217 2.48 
63.26 18.35 2.16 
59.86 1865 220 *40 308 1.10 
55.07 20.40 1.50 
ye +0 YS aa 
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both these smaller groups were in the whole 
vroup, they were subjected to the same condi- 
tions in the experiment and therefore their 
scores can be compared. 

There was less difference in the results of 
the two methods of presentation for those 
children from the high occupational level, as 
shown by the (difference divided by the stand- 
id deviation of the difference) ratios. This 
is to be expected from our study of intelli- 
gence and test scores, since the intelligence 
quotient average for the high occupational 
group is 113 and for the low occupational 
group is ror. Even allowing for pre-test ad- 
vantages, the average scores in every test 
were higher for the children from families of 
the higher occupational level. 

The percentage of loss from the first reten- 
tion test to the last retention test is less in 
this experiment than any of the others. By 
the direct type of presentation the high occu- 
pational group lost nine per cent; the low 
occupational group lost seven per cent. By 
the loud speaker type of presentation the high 
occupational group actually gained two per 
cent, and the low occupational group lost six 
per cent. The gain on the part of the high 
occupational group possibly is due to reminis- 
cence. However, they show a loss on the 
average in that part of the retention test 
which covered the material given them by the 
B or loud speaker type of presentation. 

The third retention test scores in three out 
of four cases are higher than the correspond- 
ing second retention test scores. This in- 
crease may be due to reminiscence. 

The total net learning gain for the high 
group from the pre-test average to the final 
test average by the A method is 37 per cent; 
by the B method, 23 per cent. For the low 
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occupational group the net learning gain as 
shown between the averages for the A method 
is 34 per cent; for the B method, is 27 per 
cent. The low occupational group then, 
profited more in terms of the percentage 
learned from the loud speaker presentation 
than did the high occupational group. 


TABLE X 


SCHOOL GRADE AVERAGES FOR CHILDREN OF 
PARENTS IN HIGH AND LOW 
OCCUPATIONAL LEVELS 

School Marks 


Occupational Level 1 Ss @$ €8 ¢ 4 
Children from Pro- 


a 319 9 4 1 
Children from Low 
Occupations __-_—-- 1 9 33 35 9 4 1 


Children of parents in professional work 
averaged two points higher on a seven point 
scale than children of the lower groups. The 
relationship of the scores, then, for this group 
is consistent with the group concerning school 
marks. 

On the high occupational level only one 
pupil came from a home without a radio; 
since there were 36 pupils in the group, this 
pupil represents about three per cent of the 
group. In the low occupational group nine 
pupils reported no radio in the home. Five 
of this group did not report. There were 92 
pupils in this group, which means that about 
ten per cent reported no radio in their homes. 

Since the radio group has a lower average 
on tests over material learned from the loud 
speaker, the report on the occupational levels 
is consistent. 

About 65 per cent of the children from the 
high occupational level read material, and 
about the same percentage of the children 


TABLE XI 


MEAN TEST SCORE DIFFERENCES FOR CHILDREN OF FOREIGN BORN PARENTS 
BY A AND B PRESENTATIONS 


Test Greup No. 

>... ent A 85 

B 86 
Retention —_ = a 86 
(> ety eee B 86 
Retention —_- re ee A 86 
Fee Fe wha chuicteael: Ee 86 
I i Se ee A 86 
Wl cs cc eet aie B 86 


Standard _Diff. 
Mean Deviation om . Diff. cDiff. c¢Diff. 
3339 1945 199 6 38 2.89 
io89 1980 214 «980 -3.06 8.20 
4351 2135 «4230 —%8 386 —08 
1323 «2415 26] 268 844 TB 
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from the low occupational level read. If the 
reading affected one group, one would expect 
it to affect the other group likewise. 

It is interesting, however, that more than 
one half of this group read related material, 
even though no suggestion to do so was made 
during the experiment. 

None of the foreign born parents is re- 
ported in the high occupational or profes- 
sional group, while 17 per cent of the foreign 
born are reported in the low occupational 
level group. 


Foreign Born Parents 

From the school census sheets the birth 
places of parents were tabulated for those 
pupils taking part in the experiment. For 
this study only the pupils were taken whose 
parents were both reported as being born in 
some country other than the United States. 
Most of these parents were born in Russia. 
A comparison of their means can be found in 
Table XI. 

On the pre-test the mean scores on the ma- 
terial covering each type of presentation are 
practically the same, at least their (difference 
divided by the standard deviation of the dif- 
ference) ratio shows that there is no signifi- 
cant difference. The first retention test aver- 
ages show a decided advantage in favor of the 
A type of presentation; reduced to chances 
this type of presentation is better by 99 
chances in 100. On the third and last reten- 
tion test the means show that the pupils re- 
membered more of what they had learned in 
the speaker’s presence, but they lost some of 
what they had learned as is indicated on the 
first retention test. The material they learned 
over the loud speaker shows a definite gain 
in the last two retention tests over the first 
retention test. In one of the two cases the 
pupils had a higher mean score on the third 
retention test than on the second. The most 
obvious explanation for this is reminiscence. 

In every test the mean scores of these 
pupils are lower than the mean scores of the 
group as a whole, as shown in Table II. 
These pupils have an intelligence quotient 
average of 99, which is below the average of 
the group as a whole. Therefore, the scores 
reported in this table are consistent with the 
intelligence study. They also averaged a 
“4” in their marks on junior high school 
work. This mark is a low average on a seven 
point scale where “1” is perfect and “7” is 
failure. Therefore, the scores of these pupils 
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are consistent with the scores in the section 
concerning school marks. 

These pupils show a net learning gain of 3; 
per cent from the time of the pre-test to the 
last retention test on the material they 
learned by the direct method. This is one 
per cent more than the gain of the group as 
a whole, which was 32 per cent. Of the ma- 
terial taught them over the loud speaker, they 
show a net learning gain of 32 per cent be- 
tween the pre-test and the last retention te<t 
This is more than the group as a whole 
gained, which was 28 per cent. This extra 
gain might be due in part to the novelty oj 
learning over the loud speaker for some of 
these children, since thirteen per cent of the 
group reported coming from homes without 
radios. 


No Radios in the Homes 

The group of pupils reported in Table X11, 
as not having radios in their homes, averaged 
below the group as a whole on all scores ex- 
cept the pre-test score covering the material 
taught by the direct method. Therefore, 
their averages on succeeding tests may be 
higher for this one method because of their 
knowledge prior to the experiment. Not al- 
lowing for the pre-test advantage, and re- 
ducing the ratios to chances in 100, the 
superiority in favor of the direct method over 
the indirect method is 98 chances in 100 on 
the first retention test, while on the second 
retention test the direct method is superior 
by 84 chances in roo. 

Due to some additional loss of the material 
learned by the indirect method and a gain on 
the material learned by the direct method on 
the third retention test, the latter method is 
superior by 93 chances in 100. 

On the retention tests they averaged higher 
on the material learned directly. However, 
their averages indicate that they forgot more 
of the material learned directly, which re- 
duces the difference between the averages on 
the material learned from the two methods. 

Had the (difference divided by the stand- 
ard deviation of the difference) ratio on the 
pre-test been reduced to zero by reducing 
their advantage on the material presented by 
the A type of presentation, undoubtedly the 
differences of the means on the following 
tests, shown by the ratios, would have been 
changed. Assuming that the advantage in 
favor of the direct lesson material on the 
pre-test played a constant part on the follow- 
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ing test scores, the indirect method would be 
superior on the second retention test by 65 
chances in 100. The direct method would be 
superior on the first retention test by 74 
chances in 100, and superior on the second re- 
tention test by only 53 chances in roo, thus 
making all of the mean differences statisti- 
cally insignificant. 


The net learning gain on the material 
learned by the A type of presentation is 23 
per cent, which is less than for the group as 
a whole by nine per cent. The net learning 
gain on the material learned from the loud 
speaker presentation is 24 per cent. This is 
four per cent less than for the group as a 
whole. In proportion, then, to the whole 
group these pupils really turned in a better 
performance on the tests covering the ma- 
terial they heard over the loud speaker. It 
is to be expected that the mean scores of this 
group would be below the average of the 
group as a whole, since ten per cent of this 
group came from the low intelligence quotient 
group and sixty per cent came from the 
middle intelligence quotient group. Only 43 
per cent of the group as a whole were in the 
lower and middle intelligence quotient groups. 
Also, more of this group came from the low 
occupational group. 


SUMMARY AND CONCLUSIONS 


This study was concerned with the learn- 
ing and retention of factual material pre- 
sented over the loud speaker to four hundred 
and forty-nine normal eighth grade pupils 
compared with their learning and retention 
of factual material presented directly by the 
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After a pre-test was given, four lessons 
were presented to the pupils on which they 
were tested immediately after each lesson and 
at two other dates, later by twenty-four and 
twenty days, respectively. 

In presenting each lesson in each of the 
ten schools there were two groups of pupils. 
One group heard a lesson directly from the 
speaker with the speaker present, and the 
other group listened at the same time in an- 
other room over the loud speaker. The four 
presentations were rotated between the groups 
so that each group heard two talks over the 
loud speaker and two directly from the 
speaker. The results were combined accord- 
ing to the types of presentation. The results 
of the two types of presentation with the 
group as a whole were studied, and in con- 
nection with several other factors common to 
the whole group or parts of the group. Nat- 
ural scheol room conditions prevailed through- 
out the experiment. 

The pupils knew a similar amount of the 
material presented to them by the two types, 
as shown by the mean scores on the pre-test. 
On the immediate retention tests the mean 
differences between the amounts learned by 
the two types of presentation are significant 
in favor of the direct type of presentation, 
that is, having the material presented to the 
pupils by the speaker being present. How- 
ever, on the last two retention tests the mean 
differences indicate that the two types of pres- 
entation are almost equally effective. At 
least, the (difference divided by the standard 
deviation of the difference) ratios indicate 
that the differences are not significant. 

The pupils made the greatest net learning 


speaker himself. The content of the factual gain on the material presented to them 
material concerned narcotics. directly. While there was forgetting of 
TABLE XII 


MEAN TEsT SCORE DIFFERENCES FOR PUPILS From HOMES WITHOUT RADIOS 
BY A AND B PRESENTATIONS 


Test Group No. 
Pre-test a cesdiibsig Glad tc‘ catco tenis oediin a 60 
B 60 

| VR = pa aa oe a a A 60 
. ke ae eee B 60 
Retention - < = one 60 
. § eee eee oleae B 60 
Retention .............. el ois = ae 60 
Test III ___- ae es 60 





Standard _ Diff. 
Mean Deviation om . Diff. cDiff. coDiff. 
3600 1715 208 473 380° 143 
5242 1685 «217 «855 SMB 2.09 
4913 1735 «224 872 «362 1.02 
i707 2295 «287 888-8921. 
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the amounts learned by both types of 
presentation, a higher percentage of the 
material learned over the loud speaker was 
remembered. 

In almost every case, apparently due to 
reminiscence, the last retention test scores 
were higher than the next to the last reten- 
tion test scores. 

Minor studies connected with this problem 
are as follows: 

1. Even though no suggestion was made to 
the pupils that they read related material, 
some reported that they did read in the in- 
tervals between the retention tests. Their 
reading had a tendency to raise their reten- 
tion test scores above the average of the 
group, but not sufficiently to make their 
scores significantly higher than the group as 
a whole, and not a sufficient amount to ex- 
plain away reminiscence. 

2. Pupils with high average marks in their 
regular junior high school work learned more 
by both types of presentation than those with 
the low average school marks. The results 
from this group demonstrate again that while 
pupils remember slightly more by having the 
speaker present, they forget a smaller per- 
centage of what they learned over the loud 
speaker. 

3. Intelligence scores are a selective factor 
in learning by both types of presentation. 
Pupils of high intelligence learn more by 
both methods than the middle and lower in- 
telligence groups. The net learning gain for 
those of higher intelligence is about the same 
for both types of presentation. The middle 
and lower groups retained more by having the 
speaker present. 

4. The test scores for the boys are higher 
than for the girls by each type of presenta- 
tion. The average intelligence for the boys 
as a group was about the same as for the 
girls as a group. The pre-test mean scores 
indicate that the boys knew more at the be- 
ginning of the experiment. There was less 
difference between the two groups as the 
tests continued. Making allowance for the 
advantage the boys had at the beginning of 
the experiment, the girls’ scores are about 
the same as the boys’. The girls showed 
some superiority in learning by the loud 
speaker method of presentation. 
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5. The mean test scores of the group o/ 
pupils whose parents, mainly the father, wer: 
classed in the professional occupations wer: 
compared with the mean scores of those 
whose parents were from the unskilled and 
semi-skilled occupations. The pupils in th: 
high occupational group learned more by the 
direct method than the pupils from the low 
occupational group. Those in the low occu- 
pational group profited more from the loud 
speaker type of presentation. It is interest- 
ing to note that the high occupational group 
averaged higher in intelligence, and also aver- 
aged two points higher in their school marks 
on the seven point scale. About the same 
percentage of each of these groups read re- 
lated material between the retention tests 
About ten per cent of the low occupational 
group reported having no radios in their 
home. Learning from the loud speaker, 
therefore, for that group may have been 
somewhat of a novelty. 

6. The learning, by the two types of pres 
entation, of pupils whose parents were born 
outside of the United States was studied and 
compared with the group as a whole. Their 
test scores were lower than the group as a 
whole, and their net learning gain for the 


time of the experiment was more than that 
for the group as a whole by both methods. 
This might be due in part to the novelty of 
the loud speaker presentation, because thir- 
teen per cent of this group reported not hav- 


ing radios in their homes. On the whole, this 
group profited slightly more by having the 
speaker present, but the differences are statis- 
tically insignificant. It is of interest to know 
that this group was below the average of the 
group as a whole in intelligence. 

7. The pupils who reported that they did 
not have radios in their homes profited 
slightly more by the loud speaker presenta- 
tion for the time of the experiment. On the 
tests they averaged below the whole group by 
both methods, which is expected, since they 
averaged below the group as a whole in in- 
telligence. In school marks they averaged 
lower than the group as a whole, and a higher 
percentage came from the low occupational 
group. In general, it seems that these pupils 
were not handicapped by learning over the 
loud speaker. 











RETROACTIVE INHIBITION AS A FUNCTION OF AGE, 
INTELLIGENCE, AND THE DURATION OF THE 
INTERPOLATED ACTIVITY’ 

SisTER M. FLORENCE Louise LAHEY 


Marygrove College 
Detroit 


[he vast amount of recent experimental 
research on retroactive inhibition* has proved 
that degree of retention is largely a function 
f the activities intervening between learning 
and recall. The disuse hypothesis, therefore, 
has been discarded as a complete explanation 
of forgetting, and in its place, has been sub- 
stituted a theory based upon the inhibitory 
influence that the learning of an interpolated 
ictivity has upon the retention of previously 
acquired material. Experimental investiga- 
tions have shown that the amount of inhibi- 
tion is subject to many conditions involved in 
and surrounding the learning situation. Pre- 
vious studies of the phenomenon have been 
concerned chiefly with such factors as the 
similarity of the original and _ interpolated 
learning, the temporal position of the inter- 
polation, the amount and degree of learning 
of the original material, the amount and de- 
gree of learning of the interpolated activity, 
the kind of material used for original learn- 
ing, attitudes and mental sets, emotional dis- 
turbances before or during the interpolated 
study, and method of measurement. 

The present study® was one of a number of 
experimental investigations on retroactive in- 
hibition conducted in educational psychology 
at the Catholic University of America. A 
serial list of twenty-five verbs, taken from the 
first twenty-five hundred words of the Thorn- 
dike Word List, was studied for five minutes 
by 3,434 children from the third through 
the tenth grades of several Detroit schools. 
By providing an age range of from eight 
through sixteen years, this procedure ren- 
dered possible comparisons of nine differ- 
From the Department of Education, Catholic University, 
Washington, D. C. 

* For a comprehensive review of studies of retroactive in- 
hibition see S. H. Britt, ‘‘Retroactive Inhibition: A Review 
nee Psychovogical Builetin, XXXII (1935), 


*Sr. M. Florence Louise Lahey, Retroactive Inhibition as a 
Function of Age, Intelligence, and the ation of the In- 


terpolated Activity. Educational Research Monographs, Vol. 
ai . ington: Catholic University of America, 
$7. p. 93. 
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ent age-levels. Immediately after a four 
minute test of the original material, the ex- 
perimental groups devoted varying amounts 
of time to the study of an interpolated list 
of fifty nouns. The periods of interpolation 
employed were four, six, eight, ten, twelve, 
and fourteen minutes; each study period was 
followed by a three minute test of the inter- 
polated series. In every condition of the ex- 
periment a fixed interval of seventeen minutes 
was allowed between the end of the initial 
test and the recall of the original list of verbs. 
All groups devoted the interval between the 
end of the designated period of interpolated 
study and this recall to the group singing of 
songs well known and apparently enjoyed by 
all. This activity was chosen because it is 
one that approximates a group “rest” interval. 
For the various experimental conditions the 
duration of this “rest” activity varied from 
zero to ten minutes. To provide a common 
basis for the comparison of results, a control 
group was used. For this group there was 
no interpolated study; the entire seventeen 
minute interval between the initial test and 
the first recall was devoted to the same type 
of group singing that was employed with the 
experimental groups. In order to check the 
persistence of the retroactive effect, a second 
recall was requested of all groups at the end 
of twenty-four hours. In the interpretation 
of the results derived from this procedure, 
three problems were placed for consideration: 

1. Retroactive inhibition as a function of 
the length of time devoted to interpolated 
study. 

2. The importance of age as a factor in 
retroaction. 

3. The relationship between degree of 
mental ability and susceptibility to the detri- 
mental influence of interpolated study. 

All results given in this summary of the 
experimental procedure described are based 
upon data obtained from groups equated on 
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TABLE I 


PERCENTAGES OF RETENTION AND OF RETROACTIVE INHIBITION 
SEVENTEEN-MINUTE INTERVAL 








| | 
8 yrs. 9 yrs. | 10 yrs. | 11 yrs. | 12 yrs. | 13 yrs. | 14 yrs. 15 yrs. | 16 yrs 
' 
4-Min. Interp. 
Control 78.8 86.8 | 94.7 90.4 89.8 90.1 | 89.6 | 91.2 | 84.9 
exper. 56.3 56.4 59.6 56.4 56.0 | 57.7 59.2 | 65.6 | 78.7 
R. I. 28.6 35.0 37.1 37.6 37.6 | 36.0 33.9 28.1 | 7.3 
6-Min. Interp. | 
Control 90.6 97.6 92.3 | 90.0 89.4 89.3 | 89.9 85.6 
Exper. 55.4 48.0 47.6 58.7 59.5 62.7 65.0 | 65.1 
R. I. 38.9 50.8 48.4 34.8 33.4 m8 | 3.7 23.9 
&8-Min. Interp | 
Control 89.7 90.3 | 98.9 | 88.4 1.4 | 89.9 | 89.9 | 91.5 | 85.8 
Exper. 50.4 | 54.3 | 48.8 | 54.3 51.3 57.9 58.4 &.2 | 7.1 
_ ee 43.8 | 39.9 | 50.7 38.6 3.9 35.6 35.0 | 32.0 18.3 
10-Min. Interp. | 
Control 83.1 | 88.3 94.7 | 89.0 92.1 89.9 90.4 | 86.8 85.8 
Exper. 44.2 | 47.9 47.6 51.4 48.3 50. 6 48.1 59.4 55. 7 
R. I. 46.8 | 45.8 49.7 42.2 | 47.6 43.7 46.8 | 31.6 35. 1 
} | 
12-Min. Interp. | | 
Control 85.0 | 90.1 94.3 90.2 | 90.6 89.4 | 90.7 | 87.8 84.9 
Exper. a.5 | Gs. i @.7 44.9 48.3 | 53.8 | 58.3 | 57.5 | 63.1 
R. I. | 51.2 | 42.1 | 49.4 | 50.2 46.7 | 39.8 | 35.7 | 34.5 25.7 
} } } | 
14-Min. Interp. | | | 
Control 91.6 | 87.0 | 98.3 | 91.0 | 90.6 | 90.0 | 90.5 | 89.0 
Exper. 41.2 | 49.6 | 41.7 | 40.1 | 50.2 | 47.7 | 51.8 | 56.9 
R. I. | 55.0 | 43.0 | 57.6 | 55.9 | 44.6 | 47.0 | 42.8 | 36.1 | 
| } 


the basis of the mean score on the initial 
test, measures of variability being ignored. 
The percentage of retroactive inhibition was 
obtained from the formula: 
% retention — % retention 
control experimental 
% revention 
control 

As shown in Tables I and II, there is a 
consistent tendency, under the conditions of 
this experiment, for the amount of retention 
to decrease and for the percentage of rétro- 
active inhibition to increase with the augmen- 
tation of the time devoted to the study of an 
interpolated list twice as long as the original 
series. 

Though not so reliable a measure of inhibi- 
tion as that using the inhibitory effect mani- 
fested by individual groups, the method of 
averaging the amounts of retroaction evi- 
denced by all age-groups at each of the in- 
terpolation periods offers a convenient sum- 


mary of the varying degrees of interference 
produced by the different amounts of inter- 
polated study. The averages for the various 
periods are as indicated in Table III. 


These data indicate the gradual and regular 
increase in the percentage of the retroactive 
influence from the four to the fourteen-minute 
interpolation at the seventeen-minute recall 
and from the four to the ten, twelve, and 
fourteen-minute periods at the twenty-four 
hour recall. According to every method of 
comparison, the results indicate that the 
greatest single amount of inhibition is pro- 
duced by the first four minutes of inter- 
polated study. As interpolation time extends 
beyond four minutes, however, there is a 
gradual increase in the inhibitory effect, 
though relatively small from one interpola- 
tion period to the next. When the ten min 
ute interval is reached, there is a decided 
increase in the retroactive effect. While evi- 
dent in the ratios, the increment is not so 
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TABLE II 


PERCENTAGES OF RETENTION AND OF RETROACTIVE INHIBITION 
TWENTY-FOUR Hour RECALL 














| 8S yrs 9 yrs. 10 yrs. | 11 yrs. | 12 yrs. | 13 yrs. | 14 yrs. | 15 yrs. | 16 yrs 
— adios = 

{-Min. Interp. | 

Control | 77.2 73.6 76.1 81.4 76.8 80.9 80.9 85.1 73.3 

Exper. - - 4.5 43.8 43.6 44.1 48.0 50.5 56.4 59.2 58.3 

> ee | 42.4 40.5 42.7 45.8 | 37.5 37.6 30.3 30.4 20.5 
6-Min. Interp. 

Control. - - - - 76.9 80.0 82.7 76.8 71.5 78.7 82.8 76.1 

Exper. - - - 47.2 41.8 43.6 47.3 48.4 51.5 52.3 54.6 

R. I. | 38.6 47.8 47.3 38.4 | 32.3 34.6 36.8 28.3 
&-Min. Interp. 

Control.....| 84.8 76.9 81.8 79.7 78.8 80.3 79.5 85.0 76.3 

Exper.....--| 33.3 48.7 44.6 52.8 46.8 48.3 49.8 54.5 55.1 

RR ncak See 36.7 45.5 33.8 40.6 39.9 37.4 35.9 27.8 
10-Min. Interp. 

Control__..-| 80.8 73.2 76.1 80.8 81.5 80.2 82.1 80.7 68.4 

Exper.......| 26.0 36.3 38.8 39.5 46.9 47.2 43.9 49.1 50.9 

(aaa 50.4 49.0 50.6 42.5 41.1 46.5 39.2 25.6 
12-Min. Interp. 

Control 82.1 77.8 74.4 81.6 79.1 79.1 81.7 81.5 73.3 

Exper. _ - - 30.9 33.1 39.5 35.9 48.3 48.0 53.4 55.3 44.6 

_ ya 62.4 57.5 46.9 56.0 38.9 39.3 34.6 32.1 39.2 
14-Min. Interp. 

Control_._...| 86.4 75.1 80.1 81.9 79.1 80.4 81.7 81.8 

Exper. . . - 35.1 40.7 38.8 39.8 44.9 49.1 50.2 50.5 

4) Se 59.4 45.8 51.6 51.4 43.2 38.9 38.6 38.3 
































pronounced from the ten through the fourteen 
minute intervals, especially in the delayed 
recall. The explanation of this apparent 
plateau in the detrimental influence after ten 
minutes of interpolated study is probably due 
to the relation between intelligence and retro- 


active inhibition as shown in the latter part 
of this summary. 

Tables I and II contain the data used in 
the preliminary study of the influence of age 
upon retroaction. On first analysis, these 
data seemed to indicate that the disturbing 


TABLE III 
AVERAGE PERCENTAGES OF RETROACTIVE INHIBITION (ALL AGES COMBINED) 


i-Minute Interpolation __ 
BUND ekerdit ce ciwneune a 
6-Minute Interpolation -_-_- 
ee 
8-Minute Interpolation __ 
Control ‘ 
10-Minute Interpolation —_ , ne 
Control 
12-Minute Interpolation __ 
Control ‘ ee? 
14-Minute Interpolation _ wo tae liane 
Control 


17-Minute Recall 24-Hour Recall 


% R.I. Number % R.1I. Number 
31.2 395 36.4 395 
ae 713 713 
ii 36.0 394 38.0 394 
ioe 624 624 
37.5 450 39.8 450 
663 663 
43.3 497 45.9 497 
668 668 
aes 41.7 436 45.2 436 
onan ; 654 654 
es 47.8 492 45.9 492 
= 667 667 
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effect of interpolated material decreases with 
increasing age. In the study of the influence 
of intelligence upon the retroactive effect, 
however, the results proved that it is not age 
alone, but age in relation to intelligence that 
conditions the degree of retroaction in 
children. 

To determine the relation between intelli- 
gence and susceptibility to retroaction, the 
subjects were classified as bright, average, or 
dull on the basis of IQ’s obtained from the 
administration of the Otis Group Intelligence 
Test. Because of the limited number of 
cases in the different age and interpolation 
groups, it was decided to depart from the 
usual classification and to use 94 as the 
maximum IQ for the dull groups and to des- 
ignate all attaining an IQ above 105 as bright. 
The average groups were thus included in the 
range 95-105 inclusively. Though such 
coarse grouping would be inadvisable for in- 
dividuals, it was probably sufficiently reliable 
for the purposes of this study. 

To provide for a representative number of 
cases in the various age-groups, adjoining age 
levels were combined. This procedure gave 
combinations of the eight and nine, ten and 
eleven, twelve and thirteen, and fourteen and 
fifteen year levels. This combination into 
four age levels rendered possible at each recall 
seventy-two comparisons with the twelve con- 
trol groups, 144 in all. It is on the basis of 
the consistency of the results of this large 
number of comparisons that the general trend 
concerning the susceptibility of different de- 
grees of intelligence to retroaction assumes 
considerable reliability. 

In some of the conditions of this part of 
the procedure the numbers appear small, par- 
ticularly in certain of the dull groups. This 
is not because of insufficient subjects for the 
investigation as a whole, but because so many 
interpolation times were used that the cases 
were distributed over six different conditions 
and among four age-groups. In spite of the 
small sampling at the extreme age levels, 
however, the results are consistently in the 
same direction. 

Scores on the initial test tended to vary 
with level of orightness. At both recalls, 
with all age levels, and in every period of in- 
terpolation the absolute losses sustained by 
the experimental groups were considerably 
greater than those of the control groups of 
the same degree of intelligence. Though 
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there were appreciable differences in the per 
centages of retention of the bright, average. 
and dull, these were found chiefly in the ex- 
perimental groups, the control groups differ- 
ing but slightly in the percentage of words 
recalled. The differences in memory, cor- 
responding to level of brightness, of the con- 
trol groups were in the amounts learned 
rather than in the proportion of words re- 
called. In the experimental groups, with few 
exceptions, the percentage of retention of the 
average groups was greater than that of the 
dull; and, in every case, the retention of the 
bright surpassed that of both the average and 
the dull, in most instances by an appreciable 
amount. 

To provide a single measure of retroaction 
for the bright, average, and dull levels, the 
percentages of inhibition for the four age 
levels were averaged. Table IV gives these 
averages at the seventeen-minute recall. 


TABLE IV 


AVERAGE PERCENTAGES OF RETROACTIVE 
INHIBITION FOR ALL AGES 
SEVENTEEN-MINUTE RECALL 


Bright 
IQ 
Above 105 95-105 Below 95 


4-Minute Interpo- 


Average Dull 


| aero 26.9 34.8 39.2 
6-Minute Interpo- 
I a 29.3 38.0 44. 
8-Minute Interpo- 
ee 34.0 10.0 42.5 
10-Minute Interpo- 
a a 33.8 45.7 53.5 
12-Minute Interpo- 
IE dctccinmionbies 34.1 46.7 49.3 
14-Minute Interpo- 
ee 32.3 53.4 57.5 
"ees 31.7 43.1 47.8 


These data indicate clearly that, at the 
seventeen minute recall, the amount of inter- 
ference is consistently greater for the average 
than for the bright, and greater for the dul! 
than for the average. In each of the three 
ability groups the percentage of retroaction 
increases as the length of time given to inter- 
polated study is extended. This tendency is 
especially evident among the average, there 
being no exception at this level. In spite of 
some irregularities, the amount of interfer- 
ence evidenced by the dull increases consid- 
erably as the interpolation periods increase 
in duration. Among the bright there is an 
increment in the percentage of inhibition as 
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interpolated study increases from four to six 
and from six to eight minutes, but thereafter 
the figures remain approximately constant. 
In addition to the consistent tendency for 
retroactive inhibition to vary with brightness, 
it is interesting to note that, at this recall, 
extended interpolated study does not produce 
the additional interference among the bright 
that it does among the average and the dull. 


In the twenty-four hour recall practically 
the same varying detrimental effects prevail. 


TABLE V 


AVERAGE PERCENTAGES OF RETROACTIVE 
INHIBITION FOR ALL AGES 
TWENTY-FOUR-HouR RECALL 


Bright Average Dull 
1Q 
Above 105 95-105 Below 95 


Minute Interpo- 


ra 2 | 38.8 44.8 
5-Minute Interpo- 

ee 33.9 40.4 45.4 
8-Minute Interpo- 

at 1oOn 34.8 41.9 43.1 


0-Minute Interpo- 
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At this interval, the average amounts of in- 
hibition show that extended interpolation up 
to ten minutes increases the amount of retro- 
active inhibition in the bright groups. Ex- 
tending interpolated study beyond ten min- 
utes, however, appears to decrease the retro- 
active effect at this level. In delayed recall 
retroaction also increases through ten minutes 
of interpolation with those of average bright- 
ness, though thereafter susceptibility to the 
disintegrating influence of intervening activity 
appears to remain almost constant. With the 
dull, however, allowing for some inconsisten- 
cies, the interference increases consistently 
from one interpolation to the next through 
fourteen minutes, the longest period of inter- 
polated study employed in this study. 

The general trend found by averaging the 
amounts of retroaction is also indicated in the 
percentages of the detrimental influence 
found in the individual age-groups. Though 
at each age level there are exceptions to this 
general tendency, in most instances they are 
statistically insignificant. Table VI gives the 
data for the seventeen-minute recall. 


MI... cnaaenn 38.6 52.3 57.7 These data show that, at the eight and 

2-Minute Interpo- . a nine year level, in five of the six conditions 

lation --.---~-- 82.6 +5 ote the bright groups are affected less by the 

= ion gs scaunted 28.0 51.0 60.7 interference effect of the interpolated material 

oe 46.0 51.0 than the average; and that, in every instance, 
TABLE VI 


PERCENTAGES OF RETROACTIVE INHIBITION OF BRIGHT, AVERAGE, AND DULL GRoUPS 
SEVENTEEN-MINUTE INTERVAL 





























C. A. 8 and 9 Years 10 and 11 Years 12 and 13 Years 14 and 15 Years 
[Bright Aver- Dull [Bright Aver- Dull [Bright Aver- Dull Bright Aver- Dull 
age age age age 
-Min. Int. 25.9 32.6 37.5 | 28.8 40.0 41.7| 28.9 37.7 42.2| 24.0 29.0 35.5 
No. 39 20 «10 +«=(|38 «2381 22 |36 «300622)6d}d6S8)lCU4OG 
6-Min. Int. 33.3 37.5 43.1 | 35.3 52.3 57.6 | 27.4 35.6 41.0| 21.3 26.7 36.6 
~ a 2% 26 #10 +«»+|22 39 «36 «144 #«0387@6«682 06} 40)0~«C85tO 
8-Min. Int. 35.5 39.5 47.5 | 36.8 47.4 45.2| 36.1 39.8 40.5 | 27.7 33.3 36.7 
No. (39 22 5 |40 40 20 |42 40 29 |66 45. 24 
10-Min. Int. 36.8 41.5 64.5 | 31.5 46.9 63.2| 36.8 49.4 46.2| 30.4 45.0 40.0 
No. 49 32 #7 #»+|36#«58 «627 «|66)64906«(U22) «| BT) C405 
12-Min. Int. | 39.6 38.6 56.4 | 37.9 58.3 57.1| 38.1 47.7 42.7/ 20.8 42.2 41.1 
No. in 2 ff len & Bis a2 Aas lhe CUS 
14-Min. Int. (36.7 53.7 52.0 | 41.1 61.4 67.0| 24.7 49.7 65.9 | 26.5 48.9 45.0 
No. 6 29 #11 =|59 «63 +«232 «|58 60 16 |34 44° «11 
Mean... | 34.6 40.6 50.2 | 35.2 651.1 55.3 | 32.0 43.3 46.4/| 25.1 37.5 39.2 
ato Fir Mller, A. 
Control No. 64 45 16 #|72 «68 45 |96 81 45 |87 48 23 
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the dull suffer from the disintegrating influ- 
ence of interpolation more than either the 
bright or the average. At the ten and eleven 
year level the same tendency appears, and, 
with two statistically insignificant exceptions, 
retroaction increases consistently with de- 
creasing ability. The consistency of this tend- 
ency indicates the reliability of the conclu- 
sion that, at these age levels, there is an in- 
verse relationship between degree of bright- 
ness and the percentage of retroactive inhibi- 
tion that is produced by a similar serial list 
upon an antecedently learned list. 

At the twelve and thirteen year level the 
difference between the average and the dull 
groups at the eight-minute interpolation is 
insignificant, while at the ten and twelve- 
minute interpolations it is actually in favor of 
the dull groups. Since the phenomenon 
seems to vary in its influence with individuals, 
the effect is evidenced with only certain of 
the subjects and with these in varying de- 
grees. It is possible, therefore, that indi- 
vidual differences in susceptibility to the dis- 
turbing influence of intervening conditions 
have produced the fluctuations which appear 
to center at these ages. At the fourteen and 
fifteen year level, in the ten and fourteen- 
minute interpolations, the degree of retro- 
action is less with the dull than with the aver- 
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age groups, and at the twelve-minute inter- 
polation the difference is insignificant. Her 
also, the exceptions may be attributed to in- 
dividual differences in susceptibility to the 
disintegrating effect of interpolated activity. 
Since the numbers in the groups in which the 
dull appear to surpass the average are smaller 
than in most of the conditions, the inconsis- 
tencies may be due, in part at least, to in- 
adequate sampling. 

It is particularly noteworthy that there is 
only one instance (the twelve-minute inter- 
polation of the eight and nine year level) in 
which the retroactive influence in the bright 
groups exceeds that produced in the average 
and the dull. Equally as significant is the 
fact that at the seventeen-minute recall, 
though there are twenty-four conditions for 
comparison, there is only one case in which 
the percentage of inhibition of the bright 
groups (the fourteen-minute interpolation oi 
the ten and eleven year age-level) reaches or 
exceeds the 40 per cent point, while, except in 
the shorter interpolation periods, the percent- 
ages of inhibition in the average groups are 
ordinarily much above 40 per cent. With 
the dull, except in the fourteen and fifteen 
year groups where there is a decided drop in 
the retroactive effect at all levels of ability. 
and in the four-minute interpolation of the 


TABLE VII 


PERCENTAGES OF RETROACTIVE INHIBITION OF BRIGHT, AVERAGE, AND DULL GROUPS 
TWENTY-FOUR Hour RECALL 


10 and 11 Years 





12 and 13 Years 


14 and 15 Years 

















C.A 8 and 9 Years | 
— a | eee —— 
‘Bright Aver- Dull [Bright Aver- Dull) Bright Aver- Dull | Bright Aver- Du! 
| age | age age age 
| 
4-Min. Int. 36.0 38.7 46.1. | 38.6 44.6 50.3 | 27.1 39.9 45.6] 26.2 32.1 37.0 
Average I. Q. 109 =-:101 90 111 100 89 111 101 88 113-100 81 
6-Min. Int. 36.7 37.7 48.7 | 37.9 48.0 55.0 | 30.1 37.6 48.1] 30.9 38.4 39.8 
Average I.Q. 112 101 90 111 100 86 (112 100 88 114 101 87 
8-Min. Int. 44.9 46.5 51.3 | 30.1 42.1 35.8 | 33.8 41.9 44.3] 30.2 36.9 41.1 
Average I. Q 114 100 87 113 101 85 12 101 85 113 100 89 
10-Min. Int. 48.5 656.0 72.7 | 38.4 53.1 65.1 | 32.5 51.3 47.7] 84.8 48.9 46.1 
Average I. Q. 113101 89 111 101 87 (113 100 91 113—s 1101 87 
12-Min. Int. 46.5 64.2 73.1 | 32.0 55.5 62.7 | 34.0 45.6 37.0] 17.8 40.7 44.8 
Average [.Q. _ 115 102 92 113.101 90 113-101 89 116 =101 86 
14-Min. Int. 39.7 53.1 65.0 | 32.0 57.4 63.6) 17.0 49.0 62.8 | 23.2 44.3 651.2 
Average I.Q. (115 103 91 114-101 88 114 ‘101 90 115 102 87 
Mean 42.1 49.4 58.7 | 34.0 50.1 55.4] 29.1 44.2 46.8 | 27.2 40.2 ay. 
Control 
Average I. Q. 11] 101 91 112 100 87 112 101 86 1114 101 88 
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‘ht and nine year level, the amount of in- 
jibition is considerably greater, varying from 
40.5 per cent to 67.0 per cent. ; 

\WWhen the individual conditions of the 
+wenty-four hour recall are considered (Table 
VII), in nearly every comparison, though in 

me cases the differences are small, there 
appears a tendency for the percentage of in- 
hibition to decrease in the order of dull, 
average, bright. 

With these data as a basis for judging, it 
‘s evident that there can be but one conclu- 
sion, viz., that, under the conditions of this 
experiment, susceptibility to retroactive in- 
hibition decreases with increasing degrees of 
brightness. When it is considered that one 
hundred forty-four comparisons were made 
possible by the conditions of the experiment, 
the consistency of the results argues for a 
high degree of reliability in the data. It is 
significant that most of the deviations from 
the general tendency occur in the two higher 
age levels where inconsistencies are also noted 
in the influence of the duration of the inter- 
polated activity upon the degree of inhibition. 
With one exception, all deviations occur in 
the direction of the average and the dull 
groups. Only in the case of the twelve- 


minute interpolation of the eight and nine 
year level is there an exception among the 


bright groups. In this one case the amount 
of the difference in favor of the average group 
is so small as to be insignificant. In every 
other comparison the amount of retroaction 
is less among the bright than among the 
average. Allowing for some inconsistencies, 
there is also less inhibition among the average 
than among the dull. The difference in the 
inhibitory effect between the bright and the 
dull groups is decidedly significant in every 
condition. It is evident, therefore, that there 
is a definite inverse relationship between sus- 
ceptibility to retroactive inhibition and degree 
of brightness. 

From the consistency of the results ob- 
tained under the conditions of this experi- 
ment, the following conclusions appear 
justified. 

1. Retroactive inhibition in children is a 
function of degree of intelligence and of the 
length of time devoted to interpolated study. 
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2. Chronological age being constant, sus- 
ceptibility to the retroactive influence varies 
with level of brightness. The percentage of 
inhibition decreases as degree of brightness 
increases, 

3. As indicated by the performance of the 
groups that had only four minutes of inter- 
polation, the retroactive effect is at its maxi- 
mum rate per minute during a comparatively 
short interval immediately following the 
original learning. The detrimental influence, 
however, increases slowly but regularly as the 
duration of the interpolated activity is aug- 
mented. ‘This tendency is evident chiefly in 
the degree of retroaction of the average and 
dull groups. 

4. Among the bright there is a tendency 
for the average percentage of retroactive in- 
hibition to increase as the duration of the 
interpolated study is extended from four to 
eight or ten minutes; but thereafter the in- 
hibitory effect remains practically constant. 

5. In delayed recall, with average and 
bright groups, there appears to be an optimal 
interval beyond which increase in the dura- 
tion of the period of interpolation produces no 
further increment in the retroactive influence. 
The same trend is evident, to a lesser degree, 
in the dull groups. With the bright there is 
a marked tendency for the degree cf inhibi- 
tion to decrease as the period of interpolation 
extends beyond ten minutes. 


6. With brightness held constant, suscepti- 
bility to retroaction decreases with increasing 
chronological and/or mental age, the results 
at the eight and nine year level constituting 
an exception that may be attributed to small 
sampling and to the low scores attained on 
the initial test. 

7. Since the bright, average, and dull con- 
trol groups differ in original learning scores, 
but not appreciably in comparative retention, 
variability in the retroactive effect may be 
traced almost entirely to the influence of the 
interpolated activity. 

8. The degree of retroactive inhibition 
varies considerably with individuals of all age 
levels and of all degrees of brightness, 
though not sufficiently to conceal strong group 
tendencies. 











THE CONDITIONED REFLEX TECHNIQUE APPLIED TO A 
LESS SPECIALIZED TYPE OF LEARNING 


MERRILL 


T. Eaton 


Indiana University 


Much has been written concerning the con- 
ditioned reflex, and numerous extensive ex- 
perimental investigations have been con- 
ducted in order to determine the methods and 
techniques by which the reflex may most 
efficiently be conditioned. As a result of 
these studies, the conditioned refiex tech- 
nique has been established as an _ impor- 
tant method of modifying certain types of 
behavior. 

Although many aspects of the conditioned 
reflex technique were considered before Pav- 
lov’s work, the theory of conditioned learn- 
ing was developed largely as the result of his 
studies. This theory (14) attempts to ex- 
plain all modifications of behavior on the 
basis of the conditioned reflex. Allport’s (1) 
theory of the development of social behavior, 
Watson’s (20) behaviorism, and Smith and 
Guthrie’s (16) work are typical of this 
concept. 

If the conditioned reflex theory of learning 
is applicable to all kinds of everyday learning, 
the conditioned reflex technique applied to a 
less specialized type of learning should yield 
valuable data concerning the learning process. 

A survey of the literature concerning the 
applicability of the conditioned reflex theory 
to all modifications of behavior indicates a 
scarcity of experimental studies which have 
definitely applied the conditioned reflex tech- 
nique to the less specialized types of learning. 
Studies such as those of Jones (11) and of 
Watson and Rayner (21) do, however, in a 
general way demonstrate the application of 
the conditioned retlex technique in modifying 
emotional behavior. Thorndike (18), in his 
investigation of what he calls “the principle 
of belongingness,”’ seems very definitely to 
have used a phase of the conditioned reflex 
technique. 

There also seems to be some uncertainty 
among students of the conditioned reflex phe- 
nomena as to whether all of the so-called 
conditioned reflexes are really true reflexes. 
Tolman (10) limits conditioned reflex learn- 
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ing to the conditioning of the true reflex; ix 

attaching of the original unconditioned re. 
sponse to the conditioned stimulus. Other 
aspects of the conditioning process, which ar 
less specialized than the true conditioned \ 
flex, have been called ‘sign learning” by Tol- 
man. “Sign learning,” according to Tolman 
consists of the everyday type of learning jj 
which one learns to respond to a stimulus i: 
a different way from that in which he firs 
responded to it. This interpretation seems t 
be supported by Cason (5), who says there is 
considerable doubt whether all the condi- 
tioned reactions which have been secur 

were true conditioned reflexes. 

The Conditioned Reflex Technique 

In order to establish a conditioned respons 
there must be a stimulus capable of eliciting 
a response (the unconditioned stimulus) and 
another stimulus to which the response is not 
attached (the conditioned stimulus), and 
these two stimuli must be presented in rather 
close temporal relationship. The conditioned 
response is established when the inappropri- 
ate stimulus is substituted for the uncondi- 
tioned stimulus. It is generally considered 
that in the everyday type of learning either 
side (the S or the R) of the response mech- 
anism may be conditioned. 

An important aspect of the conditioned re- 
flex technique is the method of presenting the 
stimulus relative to time relationship. The 
conditioned stimulus may be simultaneou: 
with, may precede, or may follow the uncon- 
ditioned stimulus. In this investigation the 
following names are used to distinguish the 
three time relationships: 

1. Simultaneous. The conditioned stimv- 
lus and the unconditioned stimulus are 
presented simultaneously. 

2. Delayed. The conditioned — stimulus 
precedes the unconditioned. The two 
do not overlap in time. 

3. Backward. The conditioned stimulus 
follows the unconditioned. The two do 
not overlap in time. 
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[he conditioned reflex theory assumes that 
all learning reduces to conditioned reflexes. 
if the conditioned reflex technique, as de- 
scribed, is applied to a simple, unspecialized 
type of learning comparable with everyday 
learning situations, as contrasted to the 
highly specialized reflex, a number of ques- 
tions pertinent to learning efficiency are 
raised: 

1. Does the modification of less specialized 

behavior by the conditioned reflex tech- 
nique follow the same laws as does the 
conditioning of true reflexes? 
Which of the three methods of present- 
ing the terms (simultaneous, delayed, or 
backward) is most efficient for learning? 
What effect does the length of the time 
interval between the presentation of 
terms have upon the efficiency of 
learning? 

4. Will learning efficiency be affected if the 
time intervals are filled with interpo- 
lated activities? 

5. Will learning be facilitated by certain 
kinds of interpolated activities and be 
retarded by others? 

6. By which method of presenting terms 
(simultaneous, delayed, or backward) 
will learning be most stable (remem- 
bered best)? 

>. Are modifications of less specialized 
types of behavior by the conditioned re- 
flex technique unstable, and are these 
modifications soon lost? 

8. Will retention be affected by interpo- 
lated activities? 


The purpose of this investigation is to at- 
tempt to answer the above questions by 
applying the conditioned reflex technique to 
a simple type of learning. 

The operation of the conditioned reflex 
technique in modifying the true reflexés 
seems to be relatively well established. 
Pavlov (14) found in working with the sali- 
vary reflex that the simultaneous method of 
presenting stimuli was most effective and that 
the length of the time interval between the 
unconditioned and the conditioned stimulus 
was a determining factor in the ease of con- 
ditioning. He also discovered that if the con- 
ditioned stimulus precedad the unconditioned, 
conditioning was extremely difficult, and that 
modifications of reflex behavior by condition- 
ing were unstable. The results of more re- 


cent investigation of the conditioned reflex 
(10) indicate that the results of conditioning 
may be relatively stable. Wolfie (22), in- 
vestigating the reflex jerk of the finger, found 
backward conditioning difficult. She also 
found that the length of the time interval 
between terms was important in ease of 
conditioning. 

Froeberg (7), working with nonsense syl- 
lable pairs, found that the longer time inter- 
val and interpolated activities between terms 
reduced learning. Bergstrom’s (3) results in 
a somewhat similar study would indicate that 
the optimal interval between terms depends at 
least in part upon the type of material to be 
learned. Perhaps the most important of all 
studies dealing with the effect of time in 
learning is that of Snoddy (17). In his very 
elaborate study, Snoddy showed that time in 
relation to maturation (experience) must be 
considered as an important factor. 

The theory has been advanced that asso- 
ciation never takes place between terms un- 
less they are presented simultaneously. The 
proponents of this theory insist that where 
associations do take place between terms 
separated by a time interval, it is because of 
the fact that, when the second term is pre- 
sented, the verbal, mental, motor, or kinaes- 
thetic image of the first is reproduced; hence 
it reduces to simultaneity of presentation. 
The soundness of this theory seems not to 
have been definitely demonstrated by experi- 
mental investigations. 

One can find much relevant material con- 
cerning the problem of interpolated activities 
between learning and recall in studies dealing 
with the phenomena of retention. Studies 
which involved interpolated activities which 
affected learning and retention, “retroactive 
inhibition,’ and “perseveration” are rather 
numerous. One of the earlier studies which 
was definitely concerned with the problem of 
retroactive inhibition and perseveration was 
that of Miiller and Pilzecker (13). Their in- 
vestigations, using nonsense syllables as the 
material to be learned and retained, with the 
interpolated activities between learning and 
recall being rest, memorizing a second list of 
syllables, or memorizing pictures, found that 
the perseverative process was most influential 
immediately after learning and then gradually 
lost its effect. Heine (g) and De Camp (6), 
investigating the same problems, secured con- 
flicting results. Heine’s conclusions were in 
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general agreement with those of Muller and 
Pilzecker; but De Camp found little differ- 
ence in results when an interpolated activity 
of multiplication was compared with a rest 
period 

The most extensive study of the effect of 
the similarity of interpolated activities on the 
retention of the original materials is that of 
Robinson (13). A number of interpolated 
activities were inserted between learning and 
recall, varying from very similar tasks to 
those which were quite dissimilar. Robin- 
son's results showed that the highly similar 
interpolated tasks affected learning efficiency 
most and those which were least similar af- 
fected learning least. 


Learning Materials 

After considerable experimentation with 
different types of materials, 30 pictures of 
objects and people, approximately 7” x 10” 
in size, were selected as being the most sat- 
isfactory type of learning materials. The 
pictures were mounted on 9” x 12” drawing 
paper and each picture was given a two-word 
name. There was no connection, either in 
current use, or of the elements in the picture, 
between the name and the picture. By ran- 
dom selection the 30 pictures were divided 
into three groups of 1o each. The three 
groups of pictures were labeled respectively 
simultaneous, delayed, and backward. Each 
picture was rotated through the three methods 
of presentation so that any differences in ease 
of learning would be equally distributed. 
These pictures were named the major pic- 
tures in order to distinguish them from the 
interpolated minor pictures. 

Five different time intervals (20, 30, 40, 
50, and 60 seconds) between the presentation 
of terms were studied; and five different ac- 
tivities (rest, arithmetic, pied type, names, 
and pictures) were interpolated during the 


time intervals. The different learning 
ations are shown in Table I. The effect 
interpolated activities on learning efficienc 
was studied for all the time intervals excep 
interpolated pictures during the 20-sec 
interval. The numbers in Table I refe: 
the number of subjects for each time intery, 
in each learning situation. 

The interpolated activities varied fro 
highly dissimilar tasks to tasks which wer 
quite similar to the major learning. The 
arithmetic problems included addition, sub- 
traction, multiplication, and division; the 
pied type was single-spaced; each name con- 
sisted of two words and was quite similar to 
although not the same as, the names of th: 
major pictures; the minor pictures were ap- 
proximately one-half the sixe of the major 
pictures and were mounted on drawing paper 
The minor pictures were given names consist- 
ing of two words and the name was descrip- 
tive of the picture. 


Experimental Procedure 

The conditioned reflex technique was thor- 
oughly explained to the subject before the 
learning materials were presented. The sub- 
jects were definitely informed of just what 
they were expected to learn and how the effi- 
ciency of their learning would be tested. In 
order to be sure that the explanations were 
fully understood, materials similar to the 
learning materials were used to demonstrate 
each phase of the learning situation. The 
laboratory in which the experiment was con- 
ducted consisted of a normal, natural learning 
environment, and not more than seven sub- 
jects were involved at one time. 


At the beginning of each learning situation 
the following additional information and di- 
rections were given the subjects: “You will 
be shown 30 pictures. Each picture will be 
presented for a period of 1o seconds and its 


TABLE I 


LEARNING SITUATIONS—IMMEDIATE TEST GROUP 


Length of 
interval (Rest) 
in seconds Unfilled 
2 eA P PS 23 
re ee ra 27 
a . CE OO ee 19 
50 aie Laide 24 
ee ERE 40 


133 





Filled Filled Filled Filled 
Arithmetic Pied Type Names Pictures 
14 21 20 
21 14 20 14 
26 37 16 12 
21 20 15 30 
24 34 14 14 


106 126 85 70 
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name will be distinctly spoken twice. There 
will be a ro-second interval between each 
‘ompleted unit (picture-name) of the picture 
series and the next unit. For 1o of these 
pictures, the picture and the name will be 
viven at the same time; for 10 the name will 
be given a unit of time preceding the picture; 
ind for ro the name will be given a unit of 
time after the picture has been shown. The 
der in which the names and pictures are 
presented, however, will be an entirely chance 
order. Your task is to learn and to remem- 
ber the correct name for each picture so that 
when the pictures are again presented you cari 
give their correct names.” 


The method of testing learning efficiency 
was the same for all time intervals in all 
learning situations. As soon as the last of 
the learning terms had been presented shéets 
were handed to the subjects with the numbers 
from 1 to 30. Exactly three minutes after 
the last learning term had been presented the 
pictures were shuffled and each picture was 
presented for to seconds; but this time only 
the number of the picture was given, and the 
subjects were asked to write the name of the 
picture following the correct number on their 
sheets. Immediately following this test, if 
the interpolated activities were minor pictures 
or names, the subjects were tested on these 
activities.’ Learning efficiency was measured 
on the basis of the number of correct re- 
sponses. If both words in the name of the 
major picture were given correctly, it was 
scored two points; if one word of the name 
was given correctly, it was scored one point. 


One group of subjects, immediate test 
group, (Gr), was given only the one test, 
immediately following the learning period. 
The other group, re-test group, (G2), was 
given in addition to the immediate test, four 
memory tests at intervals of one week, one 
week, one week, and two weeks. At no time 
during this five-week period did the subjects 
have any opportunity to see the major pictures 
or to review the learning materials in any 
way. Neither were the subjects of G2 in- 
jormed that they were to have any of the 
memory tests. The memory tests were con- 
ducted in the same manner as the immediate 
test except that for each re-test the numbers 
of the pictures were changed. 


‘The performance of the subjects on the interpolated 
ctivities was analyzed but the data have little bearing on 
the weed problems of this study and are too voluminous to 
e inc 4 
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The experimental procedure for presenting 
the terms is illustrated in Figure 1. The dia- 
gram shows the plan of presentation of the 
major pictures and the interpolated arith- 
metic problems. The plan shown is for the 
20-second interval, but is representative of 
the other time intervals in the series, except 
that the interval between the words “work” 
and “stop” would be 30, 40, 50, or 60 seconds. 
If the term “rest”, “pied type’, “‘names”’, or 
“pictures” were substituted for the word 
“problems”, this diagram would be represen- 
tative of the other four learning situations. 

During the interval between the commands 
“work” and “stop”, the subjects were in- 
structed as follows: (1) unfilled—relax as 
completely as possible; (2) arithmetic—work 
continuously on the problems, both speed and 
accuracy are important; (3) pied type—work 
steadily at checking the designed letter, both 
speed and accuracy are important; (4) 
names—give close attention to the names 
read to you; you will be tested on the num- 
ber you remember; for the 20-second interval 
there will be one name, two for the 30, three 
for the 40, four for the 50, and five names for 
the 60-second interval; (5) pictures—give 
close attention to the minor pictures, since 
they will be presented again and you will 
be expected to remember their names; for 
the 30-second interval there will be one minor 
picture, two for the 40, three for the 50, and 
four during the 60-second interval. At the 
command “stop”, give your undivided atten- 
tion to the experimenter. 


BACKWARD 
Work Stop 
Picture A Problems A Name 
(10 seconds) | (20seconds) | (10 seconds) 
SIMULTANEOUS 
Picture 
Interval Name Interval 
(10 seconds) (10 seconds) (10 seconds) 
DELAYED 
Work Stop 
Name A Problems A Picture 
(10 seconds) | (20seconds) | (10 seconds) 


Figure 1—Plan of presentation of major 
pictures and interpolated activities. 


Subjects 

The subjects were 608 college students. 
There were 520 subjects in the immediate 
test group, and 88 subjects in the re-test 
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TABLE II 


AGE, SEX, AND CLASS DISTRIBUTION OF SUBJECTS—-G1 


Age Sex Class 
16 17 138 19 20 21 22 23-28 M F F S Jr Sr N 
6 36 120 157 118 47 16 20 286 234 265 203 35 17 520 

TABLE III 

AGE, SEX, AND CLASS DISTRIBUTION OF SUBJECTS——G2 

Age Sex Class 
17 18 19 20 21 22 23-40 M F F Ss J Si N 
10 17 25 16 7 3 10 50 38 67 11 7 3 ss 
group. The age, sex, and class standing of — scores are considerably higher than neces: 


the two groups are given in Tables II and III. 
lhe subjects participating in each time in- 
terval in each learning situation were unse- 
lected. A survey of their percentile ratings,’ 
which was the only available measure of 
their ability (intelligence), indicated that the 
average ability of the different groups of sub- 
jects was approximately equal. The subjects 
in the immediate test groups, however, were 
on the average more mature both in age and 
class standing than were the subjects of the 
re-test groups. 


RESULTS 


Immediate Test Group.—Modifications in 
learning efficiency resulting from the follow- 
ing three factors were studied in this phase 
of the investigation: (1) the method of pre- 
senting the terms, simultaneous, delayed or 
backward; (2) the length of the time inter- 
val between terms; and (3) the nature of the 
interpolated activities. 

(1) The Method of Presenting Terms. 
The average* scores for each of the three 
methods of presenting terms for the 520 sub- 
jects of the immediate test group for the five 
time intervals in the five learning situations 
were computed. These data are given in 
Table IV. The score made by the simultane- 
ous method of presenting terms was 2.22 
higher than that made by the delayed 
method; the delayed was 1.90 higher than 
the backward; and the simultaneous was 4.12 
higher than the backward. It will be noted 
that these differences between the average 


* Psychological Examination for High School Graduates and 
College Freshmen, prepared by L. L. and T. G. Thurstone, 
published by the American Council on Education. 

‘Due to the voluminous character of the individual data 
nly the average scores for each time interval in each learn- 
ne situation are presented 


for complete statistical reliability.* 


TABLE IV 


THE AVERAGE SCORES FOR EACH OF THE THRE} 
TIME RELATIONSHIPS, S, D, AND B Maps 
BY THE 520 SUBJECTS—GI1 


Ss D B > 
Ave. —. 13.10 10.88 8.98 13.1 
= 2.22 1.90 4.12 
a 6.9 5.43 12.87 


A further comparison of the extent of 
learning for each of the three methods is 
presented in Figure 2. The solid bar repre- 
sents the average score made by the simul- 

*The length of tables has been reduced by omission 


intermediate statistical steps. Reliability of the differen 
between averages is in terms of the sigma (diff.) 
D 


A — of 3 indicates practically complete re 


sigma diff. 
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Figure 2.—The average scores for each of 
the three time relationships, S, D and B, mad 
by the 520 subjects—G1. 
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.eous method of presenting terms; the lined The average S, D, and B scores are higher 
ir the delayed; and the open bar the back- for the 40-second interval than they were tor 
rd. This graph shows plainly that, when — either the 30 or 20-second interval. There is 
e composite scores for the three methods of _ little difference in the S and D scores, but the 

resenting terms are considered, the simul- — B score is 2.11 lower than D and 2.69 lower 

‘neous method is far more efficient than than S. The S, D, and B scores are widely 

delayed or backward, and that the de- separated for the 50-second interval and lower 


yed method is superior to the backward. than they were for the 40-second interval. 
2) The length of the time interval be- For the 60-second interval the S and D scores 
‘ween the presentation of terms.—Variations fe approximately equal, but the B score is 


n learning by the S, D, and B method of 2-55 lower. The S score is lower for the 


presenting terms resulting from the length ° than it was for the 50-second interval 
{ the time interval may be studied by a com- but the D and B scores are higher. lhe ef- 
arison of the S, D, and B scores for the fect of the length of the time interval upon 
fferent time intervals in the unfilled learning learning efficiency by the S, D, and B method 
jtuation. These scores are given under the Of presenting terms may be readily seen if 
iption “unfilled” in Table V. There is little the curves (A unfilled) in Figure 3 are 
fference between the average S, D, and B examined. 
res in the 20-second interval. The differ- The total average scores (the sum of the 
ences that do exist possess such low reliability S$, D, and B scores) for each time interval 
they may be ignored. The average S_ are given under the caption “unfilled” in 
nd D scores for the 30-second interval are Table VI. When these total average scores 
pproximately equal but the B score is con- for each time interval in the unfilled learning 
iderably lower. The differences between the situation are examined it is immediately 
iverage scores of D and B (3.37) and be- noted that the shortest time interval between 
tween B and S (3.52) are reliable differences. terms is not conducive to the most efficient 
lhe S and D scores are higher for the 30- J/earning. The total average score for the 20- 
econd interval than they were for the 20- second interval is 40.17, while for the 30- 
nd, but the B score is lower. second interval it has increased to 45.56. 


TABLE V 


THE AVERAGE SCORES MADE BY THE SIMULTANEOUS, DELAYED, AND BACKWARD METHODS OF 
PRESENTING TERMS FOR EACH TIME INTERVAL IN EACH LEARNING SITUATION—G1 
































| 20 Seconds 30 Seconds 40 Seconds 50 Seconds 60 Seconds 
Ave. Diff. Rel. | Ave. Diff. Rel. | Ave. Diff. Rel. | Ave. Diff. Rel. | Ave. Diff. Rel. 
UNFILLED 
S aso Meeekt 16.41 16.95 15.63 13.43 
D . ... |18.43 .26 .18 |16.26 .15 -19 |16.37 .58 -48 |11.71 3.92 2.90 {14.33 .90 .95 
B ..- |18.57 .14 .11 |12.89 3.87 3.21 |14.26 2.11 1.45 | 9.67 2.04 1.38 |10.88 3.45 3.11 
. 13.17 .40 .10 116.41 3.52 3.06 116.95 2.69 1.33 [15.638 5.96 4.19 |13.43 2.55 2.50 
NAMES 
Ss — = 13.70 15.06 16.40 16.43 
D ...| 9.60 2.256 1.36 |11.95 1.75 -88 |12.06 3.00 2.31 |15.47 .93 -65 [11.86 4.57 3.01 
B ...| 8.20 1.40 82 (11.85 .10 05 |10.62 1.43 99 113.338 2.14 1.30 {12.14 .28 17 
11.85 3.65 2.05 {13.70 1.85 1.15 {15.06 4.43 3.10 |16.40 3.07 2.26 |16.43 4.29 2.80 
D TYPE 
. |11.00 13.86 12.11 15.75 12.76 
_ 110.48 .52 .89 113.79 .07 .05 {11.68 43 .43 | 8.85 6.90 5.89 [12.38 . 38 38 
7.71 2.77 2.18 {11.07 2.72 1.94 | 9.14 2.54 2.39 | 5.65 $3.20 2.32 | 7.79 4.59 4.73 
. ... {11.00 3.29 2.91 [13.86 2.79 1.99 /12.11 2.97 2.91 |/15.75 10.10 8.94 |12.76 4.97 4.89 
CTURES | 
13.21 13.25 13.93 10.79 
10.71 2.50 1.77 | 8.17 5.08 2. 83 5.10 4.05 | 4.79 6.00 3.47 
19.71 1.00 .76 | 8. . 15 .85 | 7.73 1.10 .79 | 5.98 1.14 .69 
13.21 3.50 2.40 |18.25 4.33 2.38 |13.93 6.20 5.49 |10.79 4.86 2.68 
ITHMETIC | 
a . 110.57 9.19 10.08 8.43 10.38 
caus 11.64 1.07 .02 7.95 1.24 88 | 5.73 4.385 3.45 | 5.14 3.29 2.26 | 7.96 2.42 1.65 
7.29 4.35 3.30 | 4.67 3.28 2.65 | 3.08 2.65 2.26 | 3.38 1.76 1.43 .08 1.88 1.48 
10.57 3.28 2.73 | 9.19 4.52 3.98 [10.08 7.00 6.60 | 8.43 5.05 3.80 (10.38 4.30 3.23 
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lhere is a slight additional increase up to 
17.58 for the 4o-second interval. For the 
;o-second interval the score drops to approxi- 
mately the same as for the 20-second inter- 
val and maintains the same relative position 
although the trend is upward for the 60- 
second interval. For the unfilled intervals, 
the 30- and 4o-second intervals lead to the 
most efficient learning. A glance at the sta- 
tistical reliability (Table VI) of the differ- 
ences between the total average scores shows 
that they possess high reliability. The 
marked increase in learning efficiency with 
increase in the length of the time interval is 
shown by the curve (F-totals) marked “un- 
filled” in Figure 3. 
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Time intervals of different lengths between 
the presentation of terms operated in two dis- 
tinct ways to produce variations in learning 
(1) changes in the average scores made }\ 
the S, D, and B technique; and (2) increas 
in the length of the time interval through t!y 
40-second interval increased learning. 

(3) Interpolated activities—Variations 
learning efficiency by the S, D, and B tech- 
nique resulting from the interpolated activi- 
ties may be studied by comparing the S, |) 
and B scores for each time interval in th 
filled learning situations with those of th: 
unfilled. 

In all but three instances, the 20 and 60- 
second intervals in the unfilled learning situa- 
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Figure 3.—The efficiency of learning by the S, D, and B methods 
of presenting terms. 
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TABLE VI 

THe TOTAL AVERAGE SCORES FOR EACH TIME INTERVAL IN EACH LEARNING SITUATION —GI1 
20 Sec. 30 Sec. 40 Sec. 50 Sec. 60 Sec. 20 Sec. 

UNFILLED 

Ave. 40.17 45. 56 47.58 37.00 38. 63 40.17 

Diff... 4.39 2.02 10. 58 1. 63 1.41 

R 1.38 . 58 3.07 51 43 

NAMES 

Ave. 29.65 37. 50 37.75 45. 20 40.43 29.65 

Diff. 7.85 . 25 1.45 4.77 10.78 

Rel. — 1.58 . 06 2.09 1.22 2.45 

PIED TYPE 

Ave. . 29.19 38.71 32. 92 30.25 32.94 29.19 

Diff. 9.52 5.79 2.67 2.69 3.75 

Rel. ne 2.63 1.70 95 95 1.21 

PICTURES 

Ave. 33.64 30. 33 30. 50 21.50 

Diff. 3.31 aa 9.00 

Rel. . 80 05 2.16 

ARITHMETIC 

Ave. 29. 50 21.81 18. 88 16.95 24.42 29. 50 

Diff. 7. 69 2.93 1. 93 7.42 5. 08 

Rel. 2. 66 1. 00 . 57 2.06 1.61 


tion and the 20-second interval in the inter- 
polated arithmetic problems, learning was 
more efficient by the simultaneous method 
than it was by the delayed or backward. In 
only four cases are the scores higher by the 
backward method than by the delayed: _ the 
o-second unfilled, 60-second interpolated 
names, and the 40- and 60-second interpo- 
lated pictures. Although these average 
scores are slightly higher, the statistical re- 
liability of the differences (Table V) is only 
slightly better than chance. There is one 
instance, a 20-second unfilled, where the aver- 
age score was higher by the backward method 
than by the simultaneous. The differencé 
(Table V) is hardly more than a chance dif- 
ference. When the average scores made by 
the simultaneous method are compared with 
those made by the backward method, the 
higher average scores are registered by the 
simultaneous method, except in the case men- 
tioned, and in general the differences between 
the average scores made by the simultaneous 
and backward methods possess consistently 
high statistical reliability. 

A comparison of the S, D, and B curves for 
the filled learning situations with those of the 
unfilled (Figure 3) shows that, although the 
interpolated activities did produce variations 
in the S, D, and B scores, the variations in 
the four filled learning situations were neither 
uniform nor constant. The efficiency of the 


S, D, and B technique was markedly changed 
by some of the interpolated activities, while 
other activities produced relatively small 
changes. 

Changes in learning efficiency during each 
time interval resulting from the interpolated 
activities may be more clearly observed by 
comparing the total average scores (Table 
VI) for each time interval in the filled learn- 
ing situations with those in the unfilled. The 
general trend of the scores in the interpolated 
name learning situation is similar to that in 
the unfilled. 

In interpolated names, however, learning 
was most efficient during the 50-second in- 
terval; and, even for the 60-second interval, 
the score was well above the 30- and 4o0- 
second interval performance and far above 
the 20-second interval. The differences be- 
tween these averages possess high statistical 
reliability. 

In the interpolated pied type learning situ- 
ation the general trend of the data has 
changed. In this case the 30-second interval 
was conducive to the most efficient learning, 
the scores on the other four intervals being 
rather close together. More changes are 
noted in the trend of the scores in interpo- 
lated pictures. In this learning situation the 
shortest time interval, the 20-second, pro- 
duced the most efficient learning. There was 
a slight drop in the score from the 30- to the 














70 
go-second: the 50-second remained practi- 
cally the same; but a considerable drop in 
score took place from the 50- to 60-second 
interval. The reliability of these differences 
is low, except the difference between the 
scores for the 50- and 60-second interval. 

Very marked changes are noted in the 
scores for interpolated arithmetic. Here the 
shortest, 20-second, and the longest, 60- 
second, intervals produced the highest scores. 
Che total average scores for the intermediate 
time intervals, 30, 40, 50 seconds, are con- 
siderably lower than for the short and long 
intervals. 

Comparing the scores for the different time 
intervals in each learning situation, it is clear 
that the interpolated activities have produced 
a complete reversal in the trend of the scores 
in relation to the length of the time interval. 
rhis condition may be seen by examining the 
curves (F-totals) in Figure 3. The plotted 
scores for the unfilled time intervals give an 
almost perfect bell-shaped curve, showing the 
highest scores for the middle time intervals 
and the lowest scores for the short and long 
intervals. The curves for interpolated names 
and pied type are still bell-shaped, but not 
as regular as for the unfilled scores. For in- 


terpolated pictures the curve has attained a 


new shape. The curve for interpolated arith- 
metic is again bell-shaped but completely in- 
verted. From the unfilled scores through the 
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interpolated activities the curves change from 
an upright bell-shape to an inverted by 
shape. 

The data in Table VI have been reorga: 
ized in Table VII so that the total average 
scores made in each time interval in the filled 
learning situations may be compared directly 
with the scores in the unfilled. Learning was 
most efficient for the unfilled learning situa- 
tion. When activities were interpolated 
names had the least deteriorative effect, fol- 
lowed by an increasing reduction in total 
average scores through pied type, pictures, 
and arithmetic. 

The total average score for the 20-second 
unfilled interval is considerably higher than 
that for names. There is little difference 
however, in the scores for names, pied type. 
and arithmetic. The differences of 10.52 be- 
tween the scores for unfilled and names, and 
of 10.67 between unfilled and arithmeti: 
possess high reliability. 

For the 30-second interval the difference 
between the total average scores for unfilled 
and names is 8.06, which is near complete 
statistical reliability. The total average 
scores for names and pied type is about the 
same, but for interpolated pictures there is a 
drop in score of 5.07 below the score for pied 
type. Interpolated arithmetic caused another 
drop of 11.83 below the score on pictures. 
The differences between the average scores 


TABLE VII 


THE RELIABILITY OF 


THE DIFFERENCES BETWEEN THE TOTAL AVERAGE SCORES FOR EACH 


TIME INTERVAL IN EACH LEARNING SITUATION 


Names 


29. 65 


Unfilled 
20JSECONDS 
Ave. 

Diff. 
Rel. 


30 SECONDS 
Ave. 
Diff. 
Rel. 
40 SECONDS 
Ave. 
Diff. 
Rel. 


50 SECONDS 


40.17 


45. 56 


60 SECONDS 
Ave. 
Diff. 
Rel. 


Pied type 
29.19 


Unfilled 
40.17 


Arithmetic 


29. 50 
10. 67 
. 84 


Pictures 


. 75 
71 


. 70 
. 92 


. 05 
. 63 


.21 
. 37 
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for pictures and arithmetic, and arithmetic 
ind unfilled, are reliable statistical differ- 
ences. 

For the 40-second interval the score for the 
infilled learning situation is 9.83 higher than 
the score for interpolated names. This dif- 
ference has high statistical reliability. There 
s also a drop of 4.83 between the scores of 
names and pied type. There is only a slight 
drop in score from pied type to pictures, but 
there is a difference of 11.45 between the 
scores of pictures and arithmetic. Between 
the unfilled and the interpolated arithmetic 
scores there is a difference of 28.70. This 
difference has full statistical reliability. 

The total average score for interpolated 
names in the 50-second interval is 8.20 higher 
than the total average score for the unfilled 
interval. It will be observed that the differ- 
ence between the scores for unfilled and in- 
terpolated names has high statistical reliabil- 
ity. The total average score for interpolated 
pied type is 14.95 lower than for names. This 
difference is completely reliable. There is 
very little difference between the scores for 
pied type and pictures, but for interpolated 
arithmetic the score is 13.55 lower than the 
interpolated picture score. This difference is 
completely reliable, as is the difference of 
20.05 between the scores on interpolated rest 
and arithmetic. The score for the 60-second 
interval in interpolated names is 1.80 better 
than the unfilled learning situation. This 
difference is only slightly better than chance. 
The difference of 7.49 between the scores on 
interpolated names and pied type possesses 
high statistical reliability. There is a differ- 
ence of 11.44 between the scores on inter- 
polated pied type and pictures, and this dif- 
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ference also possesses high statistical reliabil- 
ity. There is a difference of only 2.92 be- 
tween the score for interpolated pictures and 
arithmetic, but there is a difference of 14.21 
between the scores on interpolated arithmetic 
and unfilled which is a reliable statistical 
difference. 

The reduction in learning efficiency result- 
ing from the different interpolated activities 
can be seen in the downward slope of the 
curve in Figure 4 from unfilled through the 
other interpolated activities down to arith- 
metic. The curve is plotted from the average 
score made in each learning situation. No 
sudden or extreme drops appear in the curve, 
but rather a slow, gradual decline. 

This is an interesting phase of the study: 
other investigations have indicated that with 
interpolated activities learning would be re- 
duced in proportion to the similarity of the 
interpolated activities to the major learning 
materials. Apparently this is not true in this 
study, since names and minor pictures are 
both quite similar to the major pictures, and 
pied type and arithmetic are quite dissimilar, 
yet learning was reduced least by names and 
most by arithmetic. This would indicate 
that the amount of concentration (attention) 
demanded by the interpolated activities was 
the dominant factor, rather than the simi- 
larity or lack of similarity of the interpolated 
activities to the major learning materials. 

The interpolation of activities between the 
presentation of terms affected learning effi- 
ciency in three major respects: (1) produced 
changes in the relationship of the S, D, and 
B scores; (2) caused variations in learning 
in the different time intervals; and (3) caused 
a reduction in learning. 
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Figure 4.—The effect of interpolated activities on 
learning efficiency. 
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Re-test group 

The 88 subjects in the re-test group were 
given, in addition to the test immediately fol- 
lowing the presentation of the major pictures, 
four subsequent tests at intervals of one 
week between each of the first three tests and 
two weeks preceding the last test, thus dis- 
tributing the tests over a period of five weeks. 
The experimental procedures for the re-test 
groups were identical with those of the im- 
mediate test groups, except that only one time 
interval, the 30-second, was studied. 

The important problems involved in the 
study of the average scores made by the re- 
test group of subjects are: (1) By which 
method of presenting terms, S, D, or B, is 
learning retained best? (2) the variations in 
retention for the different tests; (3) varia- 
tions in retention in relation to the interpo- 


SCORES 
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lated activities; (4) a comparison of the efti- 
ciency of learning of the re-test group with 
that of the immediate test group. 

(1) By which method of presenting ter: 
S, D, or B, is learning retained best? Th, 
average scores made by the S, D, and 
methods for each learning situation in each 
test are given in Table VIII. Variations j, 
retention by the S, D, and B method of pre 
senting terms are somewhat more pronounced 
in interpolated arithmetic, pictures, anc 
names than in the unfilled and pied type 
learning situations. The curves in Figure ; 
show no marked variations in degree of re 
tention by the three methods of presenting 
terms. 

The curves in Figure 5 are plotted from 
the data given under averages (the averages 
S, D, and B scores for each test) at the bot 





Zs 


Figure 5.—-The total average scores made on each test by the 
S, D, and B method of presenting terms—G2. 


tom of Table VIII. These curves show that, 
although there are slight variations in reten- 
tion from one test to another, retention in re- 
lation to the method oi presenting terms is 
relatively constant. More is learned by the 
simultaneous method than by the delayed or 
backward, and more by the delayed than by 
the backward: but the degree of retention for 
each method is approximately the same. 

(2) The variations in retention for the 
different tests.—If learning by the condi- 
tioned retlex techniaue is unstable and soon 
lost, this condition will be apparent when the 
total average scores for each test are com- 
pared. These data are given in Table IX. 
The highest total average scores are regis- 
tered on test 1, the test immediately follow- 
ing the presentation of terms. It will be 


noted that there is a considerable drop in the 
total average scores from test 1 to test 2 and 
that the statistical reliability of the differ- 
ences between these averages is high. Reten- 


> 


tion is approximately the same for tests 2 


and 3. For each of the five learning situa- 
tions the total average scores are higher on 
test 4 than they were on test 3. The increase 
in retention on test 4 over tests 2 and 3 is 
shown by the rise in each of the five curves 
representing the five learning situations in 
Figure 6. This consistent increase in the 
total average scores on test 4 over test 3 is 
considerably higher than a chance increase, 
as is shown by the reliability of the differ- 
ences between the average scores. Retention 
is slightly less efficient on test 5 (for inter- 
polated pied type it is slightly better) than 
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TABLE VIII 


THe AVERAGE SCORES MADE BY THE SIMULTANEOUS, DELAYED, AND BACKWARD METHODS OF 
PRESENTING TERMS FOR EACH LEARNING SITUATION IN EACH TestT—G2 


Test 2 Test 3 Test 4 


Diff. Ave. Diff. Ave. Diff. Rel. 
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TABLE IX 


THE TOTAL AVERAGE SCORES MADE ON EACH TEST IN EACH LEARNING JATION— 


T2 T3 
UNFILLED 
Ave. 43. 34. 54 
Diff. 
Rel. 


PIED TYPE 
Ave. 
Diff. 
Rel. 


ARCHITECT 
Ave. 
Diff. 
Rel. 


PICTURES 
Ave. 
Diff. 
Rel. 


NAMES 
Ave. 

Diff. 

Rel. 
AVERAGES 
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on test 4, but the reliability of these differ- 
ences between the total average scores is only 
slightly better than chance. 

A comparison of the total average scores 
for each learning situation for each test (bot- 
tom of Table LX) shows the gain in retention 
on test 4 over tests 2, 3, and 5. The reli- 
ability of the difference between the total 
average scores made on tests 1 and 2 has 
complete statistical reliability. The reliabil- 
ity of the difference between the scores of 
tests 2 and 3 is low. The gain in total aver- 
age score on test 4 over test 3 is sufficient to 
show a reliability greater than chance. The 
difference between the total average scores 
for tests 4 and 5 possesses only slightly 
greater than chance reliability. 

The fact that retention was better on test 
4, three weeks after learning, than it was on 
test 2 or 3, one and two weeks respectively 
after learning, may represent what Ballard (2) 
called “reminiscence”. If these data do illus- 
trate the phenomenon of reminiscence, they 
do not support the hypothesis that reminis- 
cence is characteristic of the memory of chil- 
dren for poetry. The subjects in this study 
were not children and the learning materials 
were not poetry; and the experimental condi- 
tions were quite different from those of 
Ballard’s. 

These data show the importance of time in 
relation to efficiency of retention. If mate- 
rials of the type learned in this study cannot 


\ 
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be tested immediately after the lea 
period, then the opportune time for test 
three weeks after the learning period. 

(3) Variations in retention in relati 
the interpolated activities——A comparison 
the scores for each test of the filled learning: 
situations with those of the unfilled (Tab), 
IX) shows that, although less is learned | 
the filled than in the unfilled learning sii 
ation, the amount retained for each tesi 
the filled learning situations varies little 
that retained from what was learned i: 
unfilled. The constancy of retention, rega: 
less of the nature of the interpolated act 
ities, may be readily seen by examining | 
curves in Figure 6, the curves plotted 
the data in Table LX. 

These curves show that learning in relati 
to the interpolated activities is most efficient 
in the following order: unfilled, pied type. 
arithmetic, pictures, and names. The distance 
between these curves shows the relative e/ii- 
ciency of learning and retention for each test 
It will be noted that the curve representing 
the total average scores made on the unfille 
learning situation is well above that of th 
pied type; that the curves for pied type, 
arithmetic, and pictures are relatively clos 
together; and that the curve for names is 
well below that of pictures. These curves 
as well as the data in Table LX, show that 
retention is constant regardless of whethe: 
the intervals between terms are filled or un- 
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Figure 6.—The total average scores made on each test 
in each learning situation. 
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TABLE X 
A COMPARISON OF THE TOTAL AVERAGE SCORES MADE BY THE IMMEDIATE 4ND 
Re-Test GROUPS FOR EACH LEARNING SITUATION 
————$ —_———_ —_—_— ——— ————— Se = 22 
| Unfilled Pied type Names Pictures Arithmetic 
| Gl G2 |Gi-—~”~—«G | Gi -  Gel|Gae  g2/|Gi °°” Ge 
l45.56 43.65 |38.71 19.70 |37.50 19.70 |33.64 23.58 |21.81 30.12 
' 1.91 17.80 17.80 | 10.06 8.31 
42 4.62 4.52 | 2.56 2.52 











filled. Neither does the nature of the inter- 
polated activities affect the efficiency of 
retention. 

(4) A comparison of the efficiency of 
learning of the re-test group with that of the 
immediate test group.—-Because of differences 
in number of tests and length of time inter- 
vals between terms, possibilities for compar- 
ison between the two groups are limited. The 
immediate test group had only the one test, 
immediately following the learning situation; 
and the efficiency of learning of the re-test 
group was studied at only one time interval, 
the 30-second. Therefore, comparisons be- 
tween the two are on the basis of the 30- 
second interval in the test immediately fol- 
lowing the learning situation. The compar- 
isons are further limited to a consideration of 
the total average scores (the sum of the aver- 
age scores on the simultaneous, delayed, and 
backward method) in each learning situation. 


45 J 
40 
354 


Jou 


SCORES 


25 


20 


at 





The efficiency of learning for the different 
learning situations for the immediate test 
group (Gr) was interpolated rest first, fol- 
lowed by names, pied type, pictures, and 
arithmetic. For the re-test group (G2) 
learning was most efficient for interpolated 
rest, followed by pied type, arithmetic, pic- 
tures, and names. The total average scores 
of Gr and G2 are given in Table X. The G1 
scores are higher in all learning situations ex- 
cept interpolated arithmetic. The reliability 
of the difference between the total average 
scores in the unfilled learning situation is low, 
but in the other four learning situations the 
reliabilities of the differences are high. 

The curves in Figure 7, plotted from the 
data in Table X, are close together in inter- 
polated rest, showing Gr slightly superior to 
G2. 

The superiority of learning of G1 and G2 
increases through pied type to names, where 
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Figure 7.—The total average scores made by the immediate and 
re-test groups for each learning situation. 
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the distance between the two-curves is great- 
est Differences in extent of learning be- 
tween the two groups is reduced in pictures; 
and for arithmetic the curves have crossed, 
(;2 being superior to G1. 

The marked superiority of learning of G1 
ver G2 in pied type, names, and pictures, 
and approximately as great a superiority of 
(2 over Gi in arithmetic, is probably addi- 
tional evidence that reduction in learning re- 
sulting from interpolated activities is depend- 
ent upon the degree of concentration of the 
subjects in the activities and not upon the 
nature of the activities. 


SUMMARY 

fo investigate the efficiency of learning a 
ess specialized type of behavior by the con- 
ditioned reflex technique, 608 subjects, 520 of 
whom were tested only immediately follow- 
ing the learning period and 88 of whom in 
addition to the immediate test were given 
four additional tests distributed over a period 
of five weeks, were tested on their ability to 
learn and remember the names of 30 pictures. 
Five different time intervals, 20, 30, 40, 50, 
and 60 seconds, between the presentation of 
terms were studied; and the effect on learn- 
ing efficiency of interpolated rest, working 
arithmetic problems, checking letters in pied 
type, memorizing names, and learning the 
names of other pictures, at each time interval, 
was studied. Some of the more important 
results of the study are as follows: 

1. There are as many variations from the 
laws established for the conditioning of the 
true reflexes as there are similarities; there- 
fore, it would seem that the type of learning 
studied in this investigation does not follow 
the laws of conditioning of the true reflex. 

Of the three methods of presenting 
terms (simultaneous, delayed, and back- 
ward), simultaneous provides the most effi- 
cient learning; the backward method is least 
efficient. 

;. Time intervals of different lengths pro- 
luced changes in the efficiency of learning by 
the simultaneous, delayed, and backward 
methods of presenting terms. 

4. Increasing the length of the time inter- 
val (unfilled) between terms increased learn- 
ing efficiency, the most efficient being the 
yo-second interval. 

5. Learning efficiency was reduced when 
the interval between the presentation of 


terms was filled with activities. 





6. Interpolated activities between 
reduced learning efficiency in proportion 
the amount of attention (concentration) « 
manded. The similarity or lack of simi 
of the interpolated activities to the 
learning materials seemed not to lx 
portant. 

7. Changes in the relationship of th: 
ciency of the simultaneous, delayed, an 
backward methods of presenting terms and 
variations in learning efficiency at the diffe: 
ent time intervals resulted from the inter- 
polated activities. 

8. Retention was not affected by th 
method (simultaneous, delayed, and back- 
ward) of presenting terms. 

g. Retention was not affected by the inter- 
polated activities. 

10. Learning was stable with no tendenc\ 
for the new responses to fade away. 
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VARIATIONS OF NORMAL CHILDREN* 
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lhe purpose of this study was to deter- 
mine as nearly as possible to what extent 
ten-year-old children of normal intelligence 
differ among themselves and within them- 
selves in such traits as physical health, socio- 
economic status, mechanical ability, musical 
ability, school achievement, and behavior 
tendencies. The problem was first suggested 
because of the apparent lack of sufficient 
accurate knowledge about this kind of child. 
Because the writer was of the opinion that 
educational workers could use more data in 
this area, he set out to investigate along a 
line that appeared to have been somewhat 
neglected. 

It is very important for the organization 
of the schools that superintendents, princi- 
pals, and supervisors know whether or not 
the large masses of pupils can be grouped for 
effective work. If, as is contended, there is 
no average child, if every pupil is a law unto 
himself, then there should be more schools 
like Winnetka. If, on the other hand, some 
satisfactory principle of homogeneous group- 
ing can be found, then much time, energy, 
and expense may be saved by such grouping. 

An examination of 1,312 titles listed in the 
bibliographies of the Review of Educational 
Research showed the following numbers of 
articles devoted to children of the three 
intellectual levels: 

20 articles dealing with superior children 

38 articles dealing with dull children 

11 articles dealing with normal children. 


A canvass of the Psychological Abstracts 
from the beginning of the publication in 
1927, up to and including January, 1937, dis- 
closed the following numbers of articles deal- 
ing with some phase of variations of other 
traits in relation to intelligence: 


71 articles dealing with superior children 
99 articles dealing with dull children 
27 articles dealing with normal children. 


* This article is a condensation of a doctor's dissertation 
(1937) by the author, a complete copy of which is on file in 
the office of Dr. D. A. Worcester, Chairman of the Depart- 
ment of Educational Psychology and Measurements, at the 
University of Nebraska. The original manuscript contains 
thirty-three tables and 129 pages of profiles and graphs. Any 
ne interested in examining the data in detail may consult 
the manuscript or write the author. 


The profiles printed in Paul R. Mort’ 
book The Individual Pupil’ showed these 
groups: 

24 pupils with intelligence quotients above 

110 
23 pupils with intelligence quotients below 
90 


20 pupils with intelligence quotients between 
90 and 110. 


Of 1,543 titles of ‘Doctors’ Theses Under 
Way in Education” listed in the Journal 0; 
Educational Research for the years 1934 
1937, only two studies devoted to the norma! 
child are listed—one of these studies being 
the present one. A careful examination of 
the studies found in all the sources investi- 
gated discloses the fact that in only three of 
them has “normality” been based primarily 
upon intelligence quotients between go and 
110. None of the studies bears directly 
upon the differences of ten-year-old children 
of normal intelligence. From the above con- 
sideration it seems clear that: 

1. In the past, superior children and dul! 
children have proved relatively more inter- 
esting than normal children for those making 
scientific studies, since almost two and one 
half times as many studies are reported re- 
ferring to superior children, and more than 
three and one half times as many studies of 
dull children as those referring to normal 
children. 

2. Few studies of strictly normal children 
are reported. Being just “normal,” they have 
evidently attracted little attention. 

3. There has not been any increased in- 
terest during the past four years in the nor- 
mal child. It appears, therefore, that people 
working in the field of research in education 
have neglected the child of normal intelli- 
gence; and theoretically, one half of the chil- 
dren of the country are in this neglected 
group. In short, there is a broad field and 
an apparent need for further study of chil- 
dren of normal intelligence. 

For the purpose of this study, ten-year-old 
children whose intelligence quotients lay be- 


1Paul R. Mort, The Individual Pupil, New York I 
American Book Company, 1928. 
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tween go and 110 were selected. The Hag- 
gerty Intelligence Examination, Delta 2, for 
crades 3-9 and the Otis Self-Administering 
rests of Mental Ability for grades 4—8 were 
civen to all ten-year-old pupils in the Public 
Schools of Kearney, Nebraska, and in the 
\. O. Thomas School—the latter being the 
training school of the Nebraska State Teach- 
ers College at Kearney. These pupils had 
previously been checked in the Bureau of 
Vital Statistics at Lincoln, Nebraska, for age. 
[he Stanford—Binet Individual Examination 
1916) was then given to twenty of those 
pupils whose I.Q.’s were shown by the two 
group tests to be very far apart, or whose 
1.Q.’s the group tests showed to be near go 
or 110. Only those were chosen for the study 
whose mean I.Q.’s were between 90 and rio. 
This procedure made available a group of 
fifty-eight ten-year-old pupils of normal I.Q. 
(median == 98.8), consisting of thirty boys 
and twenty-eight girls who were used as the 
subjects for this study. 


HEALTH AND PHYSICAL CONDITION 


The “Health Record” and the “Personal 
History—Contagious Diseases’ record for 
each of these pupils were examined. These 
data were obtained from the files of the 
trained nurse of the Kearney City Schools 
and from the office of the Director of Teacher 
Training at the State Teachers College. The 
health records included data relative to 
height, weight, and defects in eyes, ears, 





HEALTH 











Ht. Wt. Ears 





Eyes 


) 7 54 82-70* v v v 
* “page 49 50-56 x V x 
AP 2 _ 687 81-82 Vv Vv x 
AQ 54 80-70 v / . 
CA... ; 4615 47-50 V v P 

53 84-67 x Vv xx 


. 66. 

. 66. 

‘ 8. 

S.D 2.83 10. 
, , 5.37 16.61 
ee ° 44.00 


Code: /—normal, x—slight defect, xx—moderate defect, xxx—severe defect, 00—removed. 





TABLE I 


RecorDs OF SIx AND SUMMARY FOR FiIFTY-EIGHT TEN- YEAR-OLD 
CHILDREN OF NORMAL INTELLIGENCE 


Teeth 
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teeth, throat, nose, glands, skin, posture, 
feet, and nutrition. Table I shows in tabular 
form portions of the data relative to these 
traits. The totals at the bottom of the table 
indicate the general trends of the different 
items included in the table. One notes that 
there are eighteen cases of severe defects as 
shown by the mark xxx. For example, one 
pupil, AP, has four severe defects,—in throat, 
nose, posture, and nutrition; and three slight 
defects in teeth, skin, and feet. Pupil CA is 
the only one with no defects marked against 
him. He has, however, had his tonsils 
removed. 


The heights and weights are shown in the 
second and third columns of the table. The 
heights vary from forty-six and one _ hali 
inches to fifty-nine inches, and the weights 
from forty-five pounds to eighty-nine pounds. 
Pupil AA is fifty-four inches tall, weighs 
eighty-two pounds, and the standard for his 
height at age ten years is seventy pounds. 
He is, therefore, twelve pounds overweight. 
Pupil AC is forty-nine and one half inches 
tall, weighs fifty-five pounds, and is at the 
correct weight for his age and height. As 
would be expected, there is a greater varia- 
tion in weight than in height. A range oi 
twelve and one half inches in a group of ten- 
year-olds whose mean height is fifty-three 
inches, and a variation of forty-four pounds 
in the same group whose mean weight is 
sixty-six pounds, indicate that the members 











Post- Feet Nutri- 
ure tion 





Skin 





Throat Nose Glands 


00 00 ’ ‘ x v Vv 
00 v xx \ x v Vv 
XXX Xxx v x XXX x xXx 

y / 
Xxx Xxx v \ x ‘ 


* The last number is the standard weight for the height; e. g., AA is 54 inches tall, weighs 82 pounds, and the standard weight for 54 


inches at age 10 is 70 pounds. 
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TABLE II 


HISTORY 
FirtTy-EIGHT 


PERSONAI 


Chick- Meas- Whoop- Mumps 
en les ing les 
Pox Cough 

¥ 


v 


¥ 


CA 
CB . v v 
Total 34 26 29 23 1 


Seab- 


CONTAGIOUS DISEASES AND VACCINATION OF SIX, AND TOTALS FoR 
TEN-YEAR-OLD CHILDREN OF NORMAL INTELLIGENCE* 


Smali Dip! 


Scar- Small Diph- Pneu- Eczema 
theria 


let pox theria monia pox 
Fever vac. 


\ 


V 


v 


4 7 1 7 3 ll 


* It is of interest to note that there have been only eleven of the children vaccinated for small-pox, and there are seven cases o 
disease; whereas, there have been forty-three vaccinated for diphtheria with only one case of the disease. 


of this group are far from uniform in height 
and weight. 

Table II shows the pupils’ personal history 
relative to contagious diseases, and vaccina- 
tion for small-pox and diphtheria. Over one 
half of the pupils have had chicken-pox and 
whooping cough; nearly one half have had 
and mumps; one pupil has had 
scabies; four have had scarlet fever; and 
seven have had small-pox. There have been 
forty-three cases of vaccination for diphtheria 
and eleven fcr small-pox. AP, the most de- 
fective (physically) boy, has had no diseases 
and has not been vaccinated, whereas the 
most nearly normal (physically) boy has had 
no diseases and has been vaccinated for 
diphtheria. 

Consideration of the data gathered relative 
to the health and physical condition of these 
fifty-eight ten-year-old children of normal in- 
telligence discloses that in thig group: (1) 
There is approximately three tintes as much 
variation in weight as in height; (2) there is 
a positive correlation of .822 + .o29 between 
height and weight; (3) there is no significant 
relationship between either height or weight 
and number of physical defects; and (4) the 
fact that a child is normal in intelligence 
gives little or no clue as to what his health 
or physical condition may be. 


measles 


Soctro-Economic STATUS 


The pupils were asked to fill out the Sims 
Score Card for Socio-Economic Status,’ since 
this method seemed more reliable and also 
more valid than would have been any attempt 
to evaluate their home conditions by means 
of personal visits to the various homes. The 
pupils were taken in small groups, and each 

* Verner Martin Sims, The 


’ us 


Measurement of Socio-Economi 
Bloomingtor Illinois: Public School Publishing 
Compa 1928 


question was read to them and explained. 
They were permitted to ask questions freely 
in order that they might fully understand 
just what was wanted. The object was to 
secure as accurate information as possible 
from the children themselves. It is not 
thought that any of them tried to misrepre 
sent the facts. They all seemed sincere an 
honest in their endeavor to answer the ques- 
tions as correctly as possible. While, in the 
opinion of the writer, the results obtained | 
the use of this score card are influenced some- 
what by subjective elements, yet the method 
is far superior to any other instrument thus 
far devised. 

The twenty-three questions on each pupil's 
score card have to do with the occupation, 
education, clubs, etc., of his parents; the size, 
the furnishings, and the conveniences of his 
home; how and where he spends his vaca- 
tions; and the frequency and type of amuse- 
ments in which he indulges. The twenty- 
three questions carry a possible total of 
eighty-two credit points. The total score is 
obtained by taking the quotient of the total 
credit points divided by the number of ques- 
tions answered (usually twenty-three), car- 
ried to one decimal place and the decimal! 
point dropped. Thus if the total credit is 65 
the score is (65 -:! 23 == 2.8) 28. Sims finds 
that the coefficient of reliability for this score 
card is .gt, and that measured by the best 
obtainable standards the score card is valid. 


Samples of the data collected for these 
pupils are shown in Table III. Three pupils 
did not fill out the score card; two answered 
a total of twenty-one questions; fifteen an- 
swered twenty-two questions each; and each 
of the remaining thirty-eight answered the en- 
tire twenty-three questions. The highest score 
for any one pupil is 32 and the lowest is 5. 
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TABLE III 


STANDINGS OF SIX AND SUMMARY FOR F'IFTy- 
EIGHT PUPILS OF NORMAL INTELLIGENCE 
IN Socio-ECONOMIC STATUS 


Questions Total 
Pupil Answered Credits Score 

BZ - — 22 70 32 
AX 23 68 30 
AD 22 33 15 
cc . 23 35 15 
BQ 22 14 6 
Al 23 ll 5 

Mean 37.32 16.42 

Median 35.00 15.57 

Q. 10.14 ° 53 

S.D. 13.76 3.12 

V. 36.86 37 .29 

Range 59.00 27.00 


Thus we have a range, in these scores, of 27 
points and a mean score of 16.42; i.e., the 
range is 1.6 times as great as the mean score. 
Sims warns against attaching too much sig- 
nificance to a comparison of local data with 
norms furnished. However, these scores, 
ranging from 5 to 32, with a quartile devia- 
tion of 4.53, and a standard deviation of 6.12, 
certainly show a wide variation in socio- 
economic status. We may sum up the find- 
ings from the data gathered relative to the 
socio-economic status of these fifty-eight nor- 
mal pupils as follows: (1) They differ 
widely among themselves; (2) they vary 
greatly within themselves in the different 
elements of wealth and social standing; and 
3) there is a slight positive relationship 
(r = .286 + .086) between socio-economic 
status and freedom from physical defects. 


GENERAL MECHANICAL ABILITY 


The Stenquist Assembling Tests of General 
Mechanical Ability® were given to the pupils 
individually. The publishers of this series of 
tests recommend that they be given as a 
group test to save time. The writer found, 
however, that the price of the series made 
group testing prohibitive. The pupils were 
called into the office one at a time and given 
the tests. Otherwise, the directions were 
iollowed rigidly, and thus the conditions were 
kept uniform. Norms for ten-year-old pupils 
were not available, and hence the norms for 
nine-yeat-old boys were used. The Stenquist 
Assembling Test Series III is designed es- 

John L 


lechenica 


Stenquist Assembling Test of General 
Illinois: C. H. Stoelting Co., 


Stenquis t. 
Ability. Chicago 
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pecially for boys, but was given to the entire 
group. It consists of a plain bolt and nut, 
a bolt and wing nut, a plain hinge, a key and 
ring, a turn buckle, a drawer pull, a trunk 
caster, a plain push button, a belt and buckle, 
and a nail clip. These ten articles were 
taken apart, and the parts of each put into 
a separate compartment in a wooden case. 
The task of the pupil was to assemble the 
articles in order, being allowed thirty minutes 
time for the entire test. 

This test was given to all of the boys 
(thirty) and twenty-six of the girls. Table 
IV is an abridged form of the original table. 
The raw scores are used rather than the 
T-scores because the only available table of 
standardized norms for Series III was ap- 
parently standardized on the raw scores. At 
the time the Manual of Directions was pub- 
lished, 1922, Series III had not been stand- 
ardized, but it has since been standardized by 
Edna Willis McElwee.! McElwee evidently 
did not use the T-scores, since the scores in 
her table of norms range from 18 to 105. 
The possible range of raw scores is from o to 
120, whereas the possible range of T-scores is 
from 49 to 82. Each of the first eight models 
used in the Stenquist test carries a possible 
credit of ten points, while models I and J 
carry twenty points each. Consequently, 
there is a possible total of 120 points. 

The data indicate that: (1) The varia- 
tion in mechanical ability among boys and 
girls of normal intelligence is very great: 
(2) girls vary slightly more than boys; (3) 
there is no significant relationship between 
mechanical ability and freedom from physica! 
defects (r = -0.175 + .092); and (4) there 
is no significant relationship between me- 
chanical ability and socio-economic status 
(r = .1IIT + .090). 


MusIcaL ABILITY 


To measure the pupils in musical ability, 
the Kwalwasser-Dykema Music Tests® were 
used. For each of the ten tests, a phono- 
graph record was played, and the pupils were 
asked to indicate by certain letters whether 
tones, pitches, rhythms, melodies, etc., were 
alike or different. The scores were trans- 
lated into percentile rankings for comparison 
with standard norms. There is some ques- 

*Edna Willis McE!wee, ‘Standardization of the Stenquist 
Assembling Test, Series III,’ Jourmal of Educationa! I's) 
chology, XXIII (September, 1932), 451-54 


_*K-D Music Tests, Published by Carl Fischer. Ir 
Square. New York, 1930. 


Cooper 








JOURNAL OF EXPERIMENTAL EDUCATION 


| Vol. 0, Ni 


TABLE IV 


SIX 


SCORES FOR 


Pupil Model Model Model Model 
A B C D 


10 
10 


10 
10 


10 
10 


10 
10 


10 2 
10 10 


10 
10 


> 
- 


0 


10 
10 


10 
10 


tion as to how much the tests measure na- 
ture, and how much they measure nurture; 
but so far as we are able to determine, no 
one doubts that they distinguish fairly ac- 
curately the pupil’s musical ability. The ten 
tests in the series include tonal memory, 
quality discrimination, intensity discrimina- 
tion, tonal movement, time discriminatibn, 
rhythm discrimination, pitch discrimination, 
melodic taste, pitch imagery, and rhythm 
imagery. The scores are presented in Table 
V. The scores of the ten tests are not 
equated; hence, they cannot be compared 
directly. Moreover, the means vary from 
12.5 to 23.5, so the standard deviations can- 
not be compared directly. Therefore, it 
seems that comparisons of the variations in 
the different parts of the music tests are best 
made by means of the Pearson formula for 
calculating the coefficient of variation. This 


is V Seed and simply 


Average 
amounts to finding the ratio between the S.D. 
and the mean, and stating it in terms of 
per cent. 

If we examine the different V’s in this 
series of music tests, we shall find that the 
lowest is for pitch imagery and is 14.6. To 
state this a little differently, at least one third 
of the scores in each of these tests depart 
from the mean for the test by more than 
14.6% of that mean. If we go, now, to the 
other extreme, we shall find in the case of 

* There has recently developed considerable objection to the 
se of the Pearson formula. largely because of the uncer- 


tainty of the zero point. However, no better formula has 
been offered. and it is used here for want of a better 


formula® 
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AND SUMMARY FOR FIFTY-SIX PUPILS OF NORMAL INTELLIGENCE 
IN THE STENQUIST ASSEMBLING TESTS 


Model 


E 


Model Model Model Model Model Tota! 
F G H I J 
20 


20 


0 10 
5 0 


11] 
104 


20 


10 2 
10 


10 


10 
10 


10 


10 
} 10 


6 


0 
0 


Boys 
68 
66 
11. 
16. 
24. 


0 
0 


Girls 
53.7 
52. 
10.9 
13.8 
25.6 


1 
6 


1 
4 
0 
0 
0 
0 


tonal movement that one third of the scores 
are away from the mean by more than 28.6% 
of the mean. When we total all the scores, 
we have a mean that is ten times as large as 
the means of the individual tests. However, 
the high scores in one test offset the low 
scores in another test, and we have a V oi 
9.1. The V’s of these ten tests indicate that 
there is more variability within the tests 
themselves than the total V shows. In the 
light of these considerations we must con- 
clude that the variability of this group of 
pupils is so great as to warrant individual 
attention to each pupil. 

Furthermore, the 7 between total music and 
physical defects was found to be zero; be- 
tween music and socio-economic status it was 
.429 + .084; and between music and me- 
chanical ability .116 + .og2. These correla- 
tions indicate that there is no significant re- 
lationship between musical ability and any 
one of these other traits with the exception 
of socio-economic status. Although the r be- 
tween musical ability and socio-economic 
status is low and would be of no predictive 
value, it probably does carry some signifi- 
cance. If musical ability is partly due to 
environment, the better environment that 
goes with a better social and economic condi- 
tion would doubtless furnish a better oppor- 
tunity for the pupil to advance in musica! 
performance and appreciation. 

From the foregoing considerations we may 
say that the data collected relative to musica! 
ability indicate that: (1) Individual differ- 
ences in general musical ability are very 
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TABLE V 


Scores OF SIX PUPILS AND SUMMARY FOR FIFTY-EIGHT CHILDREN OF NORMAL INTELLIGENCE 
IN THE KWALWASSER—DYKEMA Music TEST 


Pupil Tonal Qual- Inten- Tonal Time Rhythm Pitch Taste Pitch Rhythm Total 
Mem. _ity sity Move. Imag. Imag. 

BW 20 25 25 25 12 22 29 14 14 17 203 
BZ 19 20 25 27 19 19 27 13 14 19 202 
BP a” 24 24 12 11 13 24 12 14 18 166 
CF 14 7 21 20 13 18 20 13 12 18 166 
BV 7 14 19 8 8 19 17 2 7 18 i2 =~ 189 
AD... 17 8 9 20 11 13 16 12 12 19 137 
Mean 15.4 19.6 21.0 16.8 14.3 16.7 23.5 12.5 14.3 15.8 165.7 
Md. 15.1 19.5 21.6 16.9 13.4 16.2 23.8 12.8 14.4 15.6 164.3 
Q. 21 #17 #219 $5 19 19 24 17 #16 «+8212 88 
S. D. 2.8 2.9 3.5 4.8 2.9 2.6 3.9 2.9 2.1 2.7 15.0 
V. 18.0 14.9 16.6 28.6 20.6 15.3 16.7 23.1 14.6 7.2 9.1 


great: (2) the same pupil is likely to show 
great individual differences in various phases 
of musical ability; (3) pupils tend to show 
greater individual differences in tonal move- 
ment than in any of the other traits tested; 
(4) there is a slight positive relationship be- 
tween musical ability and socio-economic 
status, which fact probably has some signifi- 
cance; and (5) the best, and probably the 
only, safe way to determine a pupil’s musical 
ability is to test it directly. 


ACHIEVEMENT IN THE COMMON BRANCHES 


The New Stanford Achievement Test’ was 
used in this investigation because of its wide- 
spread acceptance; because of its carefully 
prepared norms; and because of the fact that 
the scores on the ten tests are equated in 
value, making it possible to compare a pupil’s 
scores in the various subjects by comparing 
them directly. The norms are so arranged 
that they show in parallel columns the stand- 


ards for chronological age, for educational 
age, and for school grade. The comparisons 
in this particular study were made in terms 
of the school grade norms. In addition to 
the ten common subjects that are measured 
by the New Stanford Achievement Test, the 
pupil’s rate and quality in handwriting were 
evaluated in terms of the American Hand- 
writing Scale. In this manner scores were 
obtained for each pupil in paragraph mean- 
ing, word meaning, dictation, language usage, 
literature, history and civics, geography, 
physiology and hygiene, arithmetic reasoning, 
arithmetic computation, and rate and quality 
of handwriting. The data gathered from the 
results of this test were arranged in tabular 
form in Table VI which shows the highest, 
lowest, and middle pupils, together with the 
usual statistical summary. 


If we examine the summary at the close of 
the table, we shall find some interesting re- 


; sults. Since the test scores are equated, we 

* Published by the World Book Company, Yonkers-on- ° P ne ed 

Hudson, New York; and Chicago, 1931. can compare directly the means in the differ- 
TABLE VI 


SCORES OF SIX PUPILS AND SUMMARY FOR FIFTY-EIGHT CHILDREN OF NORMAL INTELLIGENCE 
IN THE NEW STANFORD ACHIEVEMENT TESTS 


Pupil Par. Word Total Dict. Lang. Lit. 
Mean. Mean. Read. Usage 

Siedase oe 62 70 58 62 86 
See 82 75 78.5 78 62 55 
BB______ 69 61 65 43 44 64 
BU . a 60 63.5 67 80 52 
ey 52 55 44 42 34 
BH 27 45 3 51 20 20 
Mean 61.6 61.4 61.2 56.4 57.1 51.4 
Md.. 64.2 62.3 5 56.9 60.8 55.2 
Q... 7.9 5.4 5.7 9.8 13.7 17.2 
8. D.. 12.5 9.0 9.8 13.2 20.0 19.7 
V. 20.2 14.7 16.0 23.4 35.0 38.3 


Hist. Geog. Phys. Arith. Arith. Total Total 
and and Reas. Comp. Arith. Score 
Civ. Hyg. 

82 71 76 80 71 76 72.5 
76 84 70 74 67 70.6 72.3 
51 67 65 43 60 56.5 58.7 
20 48 64 64 65 64.5 58.7 
20 34 24 34 56 45 39.8 
20 50 24 34 42 38 33.2 
51.3 58.4 57.1 59.1 60.5 59.8 57.1 
56.7 59.4 62.1 60.2 62.5 61.1 58.4 
18.2 6.5 10.8 9.8 4.4 6.8 8.1 
20.1 11.8 17.6 13.5 9.5 9.9 9.9 
39.1 20.2 30.8 22.9 15.7 16.F 17.3 











ent columns and arrive at the “group” dif- 
ferences. The lowest mean is 51.36 (history 
and civics), and the highest is 61.62 (para- 
graph meaning), while the mean of the total 
scores is §7.15. Accordingly, we find a range 
of 10.26 in the mean scores of the various 
tests. This amounts to 17.9% of the mean of 
the total scores. Because of this variation in 
the means of the diverse tests, we again resort 
to the use of the V as a means of comparing 
the dispersions of the ten tests. We find that 
the lowest V (14.76) is in word meaning, and 
that the highest (39.15) is in history and 
civics. In other words, in each of the ten 
tests one third of the group deviate from the 
mean of the group by from 15% to 39% of 
the amount of the mean. The range, more- 
over, in total scores is 39.3, and the highest 
score is two and one fifth times as great as 
the lowest score. 

The following correlations between the 
total achievement scores and the other traits 
thus far studied are all negligible. 


Achievement— Height .077 + .091 
Achievement—Weight — .144 + .089 
Achievement—-Physical defects___ .008 = .093 
Achievement—-Socio-economic 

status = a .146 = .089 
Achievement—Mechanical ability. .177 + .088 
Achievement— Musical ability 355 + .083 


In summary, the data relative to the 
achievement of these children of normal in- 
telligence indicate that: (1) The variation 
of the group among the ten equated tests is 
considerable; (2) the individual differences 
of the pupils within themselves are great; 
(3) the individual differences of these pupils 
in their rankings in the different achievement 
tests are great with no general trend—except 
the tendency to vary; and (4) the relation- 
ships among the scores on the different 
achievement tests are positive and higher than 
between achievement scores and scores on 
any of the other traits studied, but are con- 
spicuously low even among themselves. 


BEHAVIOR TENDENCIES 


It was recognized from the start that the 
measurement of behavior is not as accurate 
as we should like. Yet, by use of the 
Haggerty-Olson-Wickman Rating Schedules,* 
ratings were secured that are valuable in es- 
timating the conduct of these pupils. The 
schedules are two in number—A and B. 


‘Published by the World Book Company, Yonkers-on- 
Hudson, New York t0 
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Schedule A is designed to estimate the pupi! 

behavior in terms of overt acts performed 

whereas, Schedule B is intended to estimat: 
his tendencies rather than overt acts. Sched 
ule B is divided into four divisions which con 
sist of graphic ratings in intellectual, physica! 
social, and emotional traits. The scores ar 
recorded in terms of totals for Schedule .\ 
and for Schedule B, and for each of the four 
divisions of Schedule B. Schedule B is cor 

sidered, by the authors of the instrument, as 
the more reliable of the two. The actua 
ratings of these fifty-eight pupils were mac: 
by their classroom teachers, and the data 
were assembled in Table VII. 

In spite of its obvious subjectivity, Sche: 
ule B has, according to its authors, “a re 
liability of .g2 as obtained from the correla 
tion between halves of the scale.” Th: 
authors further state that “ratings of the 
same pupils, when made by different judges 
under somewhat different conditions, com 
monly correlated about .60 for elementar, 
school children.”” The originators of th: 
schedules also warn that there is a “tendency 
to emphasize behavior of an aggressive typ 
and to miss certain non-aggressive types of 
nervous or emotional disorder.”  Anothe: 
caution offered by the makers of these scales 
is that the results should be considered mor 
significant for their relationship among the 
members of the local group than for thei: 
approach to standard norms. These limita 
tions should be kept in mind in any consid 
eration of the data presented. Even if we 
make all the allowance possible for the sub- 
jectivity of the schedules and for the unreli- 
ability of the Pearson formula, a careful study 
of the summary in Table VII must still con- 
vince us that the variations of these children 
in behavior is very great. 

To continue, the correlations between the 
total behavior scores and the scores of each of 
the other traits studied are shown by the fo!- 
lowing r’s: 

Behavior—Freedom from 


physical defects .___--- ‘= 106 + .090 
Behavior—Socio-economic 

status ................ r=—6O.14 = .088 
Behavior—Mechanica] abil- 

iy .................-. © = —ORM = 062 


Behavior—Total music ___ r = —0.079 + .093 
Behavior—Total achieve- 
ee eee r = —0.041 + .089 
It is patent that these correlations are al! 
negative except for freedom from physica! 
defects, but that none of the correlations is 
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TABLE VII 


SCORES OF SIX PUPILS AND SUMMARY FOR FIFTY-EIGHT CHILDREN OF NORMAL INTELLIGENCE 


ON 
Pupil Schedule Schedule 
A B 
Div. I 
\Q 59 31 
AO 42 25 
\K 52 17 
BB 30 15 
1A 0 ll 
BY 0 12 
Mean 20.94 18.7 
Median 15. 00 17.3 
Q. 12.42 3.7 
S. D 19. 96 5.3 
V. 95. 28 28.1 
Range 7 23 


high enough to have any considerable signifi- 
cance. We can safely say, then, that there 
is no significant relationship between behavior 
and any of these other traits. 

Finally, the following pertinent conclusions 
may be drawn from a consideration of the 
data relative to behavior of these fifty-eight 
pupils of normal intelligence: (1) Pupils 
show a strong tendency toward individual dif- 
ferences, within themselves, even among the 
different phases of behavior; (2) pupils show 
a greater variation in emotional tendencies 
than in intellectual, physical, or social traits; 
and (3) there appears to be no significant te- 
lationship between the pupils’ behavior scores 
and their scores in any of the other traits 
measured. 


INDIVIDUAL PROFILES 


The data collected for all these pupils in 
the various traits studied were condensed into 
a series of profiles designed to give graphic 
pictures of each pupil’s standing in the dif- 
ferent characteristics. The arrangement of 
the data on the profiles was somewhat arbi- 
trary, being so arranged as to group similar 
data on the same page and to use the same 
type of norms as nearly as possible. Each 
pupil’s profile consists of two pages, on the 
first of which is the information relative to 
the New Stanford Achievement Tests and the 
American Handwriting Scale. These scores 
are compared with the standard grade norms. 
On the second page are shown the data for 
the Kwalwasser-Dykema Music Test, the 
Haggerty-Olson-Wickman Behavior Rating 
Schedules, the Stenquist Assembling Tests of 





THE HAGGERTY-OLSON-WICKMAN BEHAVIOR RATING SCHEDULES 


Schedule Schedule Schedule Schedule 
B B B B 
Div. II Div. III Div. IV Total 
24 36 38 129 
15 40 46 126 
21 20 15 73 
19 16 20 70 
11 14 13 49 
8 12 13 45 
15.1 22.7 21.0 75. 64 
14.2 21.3 20.0 69. 32 
2.9 3.5 4.3 11. 40 
3.9 6.0 7.5 18. 89 
25.8 26.4 35.6 24.97 
16 27 3 84 


Mechanical Ability, and the Sims Score Card 
for Socio-Economic Status; and these scores 
are compared with standard percentile or 
decile norms. All scores on the profiles are 
so arranged that high scores imply desirable 
traits. There is a separate profile of two 
pages for each of the fifty-eight pupils 
studied; but because of the lack of space 
only two of the individual profiles, showing 
the greatest and the least variation, are given 
here. Perhaps no comment on these profiles 
is necessary except to say that of the fijty- 
eight pupils tested not one has all of his 
achievement scores within the scope of one 
school grade; not one even approximates an 
average score in all the music tests; and only 
one comes within twenty per cent of a uni- 
form normal rating in behavior. 


RANKING PROFILES 


As a final step in the endeavor to determine 
variations, the fifty-eight pupils were ranked, 
from 1 to 58, in height, weight, freedom from * 
physical defects, socio-economic status, me- 
chanical ability, musical ability, achievement, 
and behavior; and these rankings are as- 
sembled in Table VIII. It is to be noted 
that Pupil AA is eleventh from the top in 
height; i.e., there are ten taller than he; he 
is third from the top in weight, for two pupils 
weigh more than he; four have fewer physical 
defects than he; ten have a better socio- 
economic status than he; eighteen have better 
mechanical ability; fourteen are better in 
music; eight have a higher total achievement 
score; and one is rated by his teachers as 
having fewer undesirable behavior character- 
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PUPIL AE 


Kwalwasser-Dykema Music Test 19d 
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TABLE VIII 
RANKINGS OF FIFTY-EIGHT PUPILS IN THE EIGHT TRAITS STUDIED 
Pupil Height Weight Freedom Socio- Mechan- Musical Achieve- Behavi 
from Economic ical Ability ment 
Physical Status Ability 
Defects 

AA" 114 3 5 11 19 15 9 2 
AB 7 9 1 21 34 31 17 48 
AC 46 43 53 11 56 14 25 7 
AD 46 50 30 27 52 31 3 
AB* 51 53 23 21 17 8 21 26 
AF* 19 16 41 49 10 39 9 36 
AG 40 39 10 29 50 46 28 
AH 19 28 10 34 15 42 1 14 
Al 114 36 52 45 7 3 35 35 
AJ 19 39 30 49 22 
AK 7 28 7 17 2 23 48 
AL 7 18 10 34 22 35 41 12 
AM 51 50 48 30 54 31 33 45 
AN 19 35 51 30 55 35 42 19 
AO 19 27 10 3 19 36 55 
AP 3 5 54 45 8 27 11 47 
AQ 11+ 6 48 55 12 55 58 
AR 28 37 23 10 3 6 2 54 
AS 11+ 21 45 30 6 10 32 16 
AT 30 14 23 5 3 4 18 45 
AU 11+ 24 5 17 28 51 47 30 
AV 30 27 30 13 36 6 22 33 
AW 13 17 44 50 10 
AX 11+ 16 10 2 39 6 30 51 
AZ 7 11 30 49 52 42 54 44 
BA 46 50 30 40 36 15 40 50 
BB ; 41 32 _ 28 37 
BC 46 42 30 41 46 31 34 9 
BD 30 33 39 6 28 40 53 40 
BE 30 14 1 , 12 12 52 24 
BF 28 24 10 34 11 30 7 43 
BG 3 11 10 6 23 28 4 27 
BH 6 9 7 34 5 19 57 19 
BI 11+ 6 23 6 26 35 4 29 
BJ 30 21 10 30 15 49 26 33 
BK 51 49 30 20 39 19 56 6 
BL ea 13 1 25 16 37 
BM 30 41 10 34 23 19 24 19 
BN 30 21 45 52 32 46 51 42 
BO 30 35 41 45 19 15 13 16 
BP 18 18 30 17 48 23 39 12 
BQ 17 37 23 54 45 12 20 39 
BR 30 20 10 41 36 18 3 4 
BS 30 33 30 45 46 45 44 10 
BT 54 43 48 34 50 48 43 16 
BU 41 24 7 13 43 19 28 23 
BV 41 43 39 53 28 50 27 56 
BW 46 47 10 20 35 1 6 5 
BX 41 43 41 m an 12 7 
BY 3 6 23 41 46 40 14 1 
BZ 2 1 1 1 42 2 19 53 
CA 55 53 1 17 40 46 37 24 
CB 19 2 45 6 44 11 7 40 
CC 40 47 21 27 26 28 45 7 
cD 11+ 13 10 49 31 25 31 31 
CE 19 28 23 4 12 5 14 31 
CF* 40 55 l 21 25 23 37 14 
CG l 3 - 21 x 35 48 52 
Total 

Ranks 55 55 54 55 56 51 57 58 


*There is a ranking profile given in the present article for each pupil starred in this table. 
fvaee - same number appears two or more times in the same column the pupils are tied for th: 
rank indicated. 
. Where this symbol appears in the table it indicates that there is no record for this particular item. 
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istics. This table was arranged to rank the 
pupils, in all the traits, so that the lower the 
number in the ranking column the higher the 
rank in desirable characteristics; and contra- 
wise the higher the number the more undesir- 
able are the characteristics. There are fifty- 
eight pupils in our group, so the ranks are 
from 1 to 58. Thus a person ranking 1, 2, 
or 3 would be considered very good in the 
particular trait; one ranking 56, 57, or 58 
would be very poor in the trait; and one 
ranking two or three points either up or down 
from 29 would be about average. In some 
instances two or more pupils are ranked the 
same in the same column; e.g., pupils AA, 
AQ, AU, AX, BI, and CD are all ranked 11 
in height. This is because each of these 
pupils is 54 inches tall, and there are ten 
pupils who are taller. Wherever a dash ap- 
pears in the table it indicates that there is 
no record for that pupil in that particular 
trait. Each dash in the table diminishes by 
one point the possible number of ranks in 
that column, and the total number of ranks 
in each column is shown at the bottom of 
the column. Consequently, fifty-five is the 
lowest possible rank in the height column, 
fifty-five the lowest in the weight column, 


fifty-four the lowest in the freedom from 
physical defects column, and so on to the last 
column (behavior) which is the only column 
that has the full fifty-eight possible ranks. 
A star following a pupil’s letters in the table 
indicates that his rankings are also shown in 


an individual ranking profile. To conserve 
space only four ranking profiles are given 
here. 


In this list, Pupil AA has the most nearly 
uniform rankings of any pupil. His rankings 
range from 2 in behavior to 19 in mechanical 
ability. AE is perhaps a good example of 
what may be expected of any pupil since he 
shows a wide variation in various character- 
istics. He ranks 8 in musical ability; some- 
what above the mean in freedom from physi- 
cal defects, socio-economic status, mechanical 
ability, achievement, and behavior; and 
rather close to the lower extreme in height 
and weight. AF has a range from g to 49, 
with four above the mean and four below. 
He reaches all the way from within nine 
points of the top in one trait to within nine 
points of the bottom in another. AX ranks 
below the median in three traits and above 
in five, being within one of the top in socio- 
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economic status and within seven of the bot- 
tom in behavior. BM has a range of thirty- 
one, with three below average and five above. 
BO has a range of thirty-two, with four on 
each side of the median. BP, BQ and BR 
have similar (irregular) records. CB varies 
from 2 to 45, with five above mid-point and 
three below; CD has a high rank of ro in 
freedom from physical defects, and a low one 
of 49 in socio-economic status, with four 
above the median and four below; and BJ 
ranks 10 in freedom from physical defects 
and 49 in musical ability, with four above 
median and four below. 


Even a*casual glance at the table and the 
profiles is enough to convince one of the 
widespread differences of these pupils within 
themselves in their rankings. A more careful 
study will force one to the conclusion that 
there is little, if any, relationship among a 
pupil’s rankings in these eight different traits 
—except in the case of height and weight. 


CONCLUSIONS 


Great individual differences among children 
of normal intelligence are shown to exist. 
Although children may be of normal intelli- 
gence as indicated by their I.Q.’s, it is clear 
that they are not as a rule homogeneous in , 
any other traits that have been measured. 
In other words, it is impossible to predict 
where a pupil will stand in other traits from 
the fact that he is normal in intelligence. 
For that reason, if we wish to determine his 
capacities and accomplishments in the differ- 
ent elements of mechanics, music, the com- 
mon branches, behavior, social standing, or 
economic status, we must study him directly 
in the trait in question. Any idea that chil- 
dren who are “normal’’ in intelligence are 
therefore likely to be uniform in any of the 
other traits is not borne out by the facts as 
found in this investigation. The data further 
show that no pupil who ranked high in any 
one trait was uniformly high in all other 
traits; that no pupil who was low in any one 
trait was uniformly low in all other traits; 
and that no pupil who was “average” in any 
one trait was uniformly average in all other 
traits. It is clear, therefore, that time, 
energy, and expense will be saved, and greater 
accuracy will be obtained by studying the 
child directly rather than by trying to predict 
a trait, or a group of traits, from his normal 
intelligence quotient. 
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IMPLICATIONS 


Teachers should be trained and given more 
time to work individually with their pupils. 
In addition, they should be made to appreci- 
ate the wide individual variations that are 
likely to occur in their children. They 
should be taught how to observe and diagnose 
the case of each individual child; how to 
determine what should be done after the di- 
agnosis is made, and how to do the things 
that they have found necessary for his normal 
development. One difficulty in the past has 
been that teachers and administrators have 
believed in a superficial way that individual 
attention should be given to the children 
under their care, but they have not been 
trained sufficiently to give this individual at- 
tention. Again, many teachers are held for 
routine work, are overburdened with classes 
that are too large, or are limited to “stand- 
ardized” outcomes set by school authorities 
which prevent them from giving the indi- 
vidual attention that is obviously needed. 

The individual method used in the Win- 
netka Schools seems more sound in theory 
than do such schemes as the X, Y, Z grouping 
plan. The data gathered in this study fur- 
nished no evidence that there is an “average” 
child. The Winnetka plan takes care of the 
pupils individually in their regular school 
studies and also provides in the group activi- 
ties for their social development. The X, Y, 
Z idea would place these fifty-eight pupils of 
“normal” intelligence in the “Y” group and 
assume that they are approximately homo- 
geneous—which is not the case. Theoreti- 
cally, then, some such plan as the one used at 
Winnetka provides for the individual needs 
of the child to a much greater extent than 
does the X, Y, Z method; since thereby chil- 
dren of “normal” intelligence, who differ 
among themselves as truly as do superior and 
dull children, are also given individual 
attention. 
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SOLVING PROBLEMS OF ACTION: A PROPOSED PROCEDURE 
FOR THE SOLUTION OF PRACTICAL PROBLEMS 


P. A. MAXWELL 
Peru State Teachers College, Nebraska 


Much of overt life activity consists of the 
expression of habit patterns that have come 
to be regarded as satisfactory modes of re- 
sponse to types of situations that recur fre- 
quently. However, adjustments often are 
made in such patterns of behavior as new 
elements appear in the stimulating situations. 
Sometimes these adjustments are made be- 
cause an individual becomes disturbed over 
the efficacy of his acts and pauses to reflect, 
plan, and evaluate before acting overtly. At 
other times, especially in the lives of less 
thoughtful persons, such learnings may result 
from heedless trial and error overt responses 
or from uncritical acquiescence to suggestion 
which, like as not, is surcharged with emo- 
tional appeal. 

Behavior patterns that are intelligent, wise, 
and in the long run most efficacious are those 
acquired by the reflective route. Intelligent, 


capable, self-directing persons are those who 
recognize the significant problems of action 
that confront them, who spare time when 
possible to study and reflect upon such prob- 
lems, and who have become proficient in the 
techniques and mental processes involved in 


such study and reflection. To the degree, 
then, that intelligent, capable, self-directing 
persons are desired as products of education, 
guidance should be offered in the recognition 
and solution of significant types of practical 
or action problems. Obviously such guidance 
to be effective must provide the learners with 
adequate problem solving techniques. How- 
ever, not a great deal has been done to per- 
fect techniques for solving the action type 
of problem. To overcome this handicap and 
to provide his education classes with a pro- 
cedure for solving teaching problems, the 
writer has developed the problem solving 
technique presented in this paper. For three 
years the assignments in all of his courses 
have been in the form of “problacts,” a term 
coined by contraction of the phrase “problem 
of action.” 

The term problact is significant, for the 
recognition of the distinctive nature of the 


action problem is an important step in the 
development of a satisfactory solution tech- 
nique. In a problem of this type a person 
is disturbed and hesitates to act overtly, be- 
cause he is uncertain what action is best or 
most appropriate. Thus the problact belongs 
to that class of problems which require the 
use of evaluation for their solution, and which 
may be distinguished from problems of the 
fact or science type. Abelson’ and Monroe 
and Engelhart*® have recognized the need for 
making this distinction in the field of educa- 
tional research. The former refers to prob- 
lems requiring a consideration of values as 
“evaluative problems,” the latter as “prob- 
lems of purposes.”” Dewey, too, has stressed 
such a distinction by devoting a chapter of 
The Quest for Certainty to a discussion of 
“The Construction of Good.” The chapter 
contains the following statement: 

“The time will come when it will be found 
passing strange that we of this age should 
take such pains to control by every means at 
command the formation of ideas of physical 
things, even those most remote from human 
concern, and yet are content with haphazard 
beliefs about the qualities of objects that 
regulate our deepest interests; that we are 
scrupulous as to methods of forming ideas 
of natural objects, and either dogmatic or 
else driven by immediate conditions in fram- 
ing those about values.’ 

Another distinguishing feature of the prob- 
lact is its comprehensive nature, which serves 
to distinguish it from other kinds of evalua- 
tive problems and from problems of the 
science type as well. Symonds* has recog- 
nized this in a classification of problems in 
which conduct problems are used to illustrate 
the “complete problem.” Fact and simple 
value problems tend to arise incidentally in 


1 Harold H. Abelson, The Art of Educational Research, 
pp. 274-77. New York: World Book Co., 1933. 

* Walter S. Monroe and Max D. Engelhart, The = 73 
Study of Educational Problems, pp. 411-12. New York: 
Macmillan Co., 1936. 

_ Dewey. The Quest for Certain'y, p. 268. New 
Yor Minton, Balch and Co., 1929. 

Education and the Psychology of 
McGraw-Hill, 1936. 


* Percival M. Symonds, 
Thinking, pp. 8-10. New York: 
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the course of an attack upon a problact. 
Indeed, except in scientific research, concern 
for a question of truth or value is likely to 
be attributed to “idle curiosity,” unless it 
arises in connection with a problact. 

In general outline the proposed problact- 
solving procedure resem>les that commonly 
employed in effective thinking about problems 
of all sorts. It involves: (1) an analysis 
and definition of the problem, (2) formula- 
tion of hypotheses, (3) testing or evaluation 
of the hypotheses, and (4) conclusion, or de- 
cision to follow the most promising hypoth- 
esis. It has been necessary, how ver, to de- 
vise many of the specific steps of the pro- 
cedure in order to meet ceriain requirements 
peculiar to problact solving. This is true in 
particular of the evaluation teehnique, in 
which the hypotheses are compared in terms 
of their predicted consequences.” The prin- 
cipal difficulty with this technique has been 
encountered in the translation of the pre- 
dicted consequences into quantitative expres- 
sions of value and the synthesis of the values 
thus expressed. Abelson® has held that ‘in 
most situations” this cannot be done, whereas 
Symonds’ has suggested that when this point 
is reached one should turn his attention to 
something else or sleep on the matter, pre- 
sumably to complete the evaluative process 
subconsciously. In developing the problact- 
solving procedure described below, it is be- 
lieved that a measure of success has been 
attained in overcoming this difficulty. 

Step 1. Analysis of the Situation. The 
first step in the proposed problact-solving 
procedure consists of a careful analysis of 
the problactic situation. This is accom- 
plished by noting: (1) the general field of 
activity in which the situation is found— 
vocation, play, civic participation, home and 
family participation, the toilet, personal 
finance, etc.; (2) the particular activity, 
function, or duty involved—preparation of 
meals, curriculum adjustment, purchase of 
clothing, orchestra practice, and the like; 
(3) the disturbing elements of the situation 

a recommendation from a respected source, 
a threat from a feared source, a sudden drop 
in personal efficiency, a stimulus arousing a 
new ambition, a stimulus that has come to 
serve as a danger signal, and other similar 


* For an exposition of this method of evaluation see 
Symonds, op. cit., Chapter XV on “Problem Solving” and 
Chapter XXIV on “Thinking About Conduct.” 

* Op. cit., p. 282 

‘Op. cit., p. 159 
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factors; and (4) Significant conditioning fac- 
tors—all elements of the situation likely to 
have an important bearing upon the nature 
of the outcomes when a response to the situa- 
tion is made. 

The analysis provides the groundwork for 
the other phases of the procedure. While in 
the main it should be worked out first, the 
search for significant conditioning factors 
may be continued after the formulation and 
evaluation of hypotheses have begun. 

Step 2. Formulation of Hvypothese: 
Promising problact hypotheses in the form of 
plans of action may be created from elements 
of the past experience of the subject or of 
others. Often helpful suggestions may by 
obtained from appropriate literature. These 
may have the advantage of representing a 
number of differing points of view. Usually 
from two to four hypotheses are a satisfac- 
tory number, but to have sufficient range : 
set of hypotheses should represent all impor- 
tant points of view. To be relevant a hy- 
pothesis must promise to remove or amelior- 
ate the disturbing elements of the situation. 

Step 3. Evaluative Comparison of th 
Hypotheses. The discussion of this step is 
divided into three parts. 

a. Formulation of Criteria of Value. 
When a satisfactory set of hypotheses has 
been formulated, the hypotheses are tested or 
evaluated in order to determine the relative 
merit or value of each. There are two pos- 
sible methods of judging the merit of an act 
or series of acts. One consists of determining 
the degree of conformity of the act to prin- 
ciples or precepts that are considered sound. 
The other consists of determining the worth 
or desirability of the fruits or consequences 
of the act—probable consequences in the 
case of the hypothetical courses of action in 
problact solving. The latter method has 
been selected for our problact solving pro- 
cedure. The precept method is considered; to 
be but a makeshift procedure for use when 
time or other resources are insufficient to 
make the more thorough method of conse- 
quences practicable. The principal drawback 
to a general rule of action is its failure to 
make allowance for the many variations that 
are certain to exist in the class of situations 
to which the rule is supposed to apply. 

The evaluation of a set of hypotheses con- 
sists, then, of comparing them in terms of the 
worth of their probable consequences. Now 
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the worth or value of anything depends upon 
the amounts of desirable qualities it possesses. 
Therefore the criteria of value employed in 
comparing problact-solving hypotheses are 
variable qualities of objects likely to be af- 
fected by the hypothetical responses. This 
point is worthy of emphasis, for the common 
practice of referring to desired objects as 
values has been a serious obstacle to the de- 
velopment of effective evaluative techniques. 
Values always may be expressed as variable 
qualities of objects, that is, in measurable 
form and must be so expressed to serve as 
satisfactory problact-solving criteria. 

Qualities employed as criteria of value in 
problact solving are relevant if their amounts 
depend somewhat upon the decision to be 
made. They are significant to the degree 
that they are prized for their own sake or 
condition other qualities so regarded. Good 
problact-solving criteria are both specific and 
generic. ‘They are specific in that they are 
particular, definite values into which goodriess 
in general may be analyzed. The more spe- 
cific in this sense, the more accurate will be 
the predictions of hypothetical consequences. 
Good criteria are generic in the sense that 
each serves to measure the consequences of 
all the hypotheses formulated for a problact. 
Each is a quality of an object that will be 
present in some ensuing situation regardless 
of which hypothesis is followed. Like promis- 
ing hypotheses, relevant and significant cri- 
teria may be discovered by recalling criteria 
used previously in the solution of similar 
problacts or by looking for suggestions in 
appropriate literature. 

b. Organization of a Criterial Hierarchy. 
When a number of values have been selected 
to serve as criteria in the solution of a pro- 
blact, it frequently is found that a means-end 
relationship exists between some of them. 
Indeed, series or strands each consisting of a 
number of the criteria usually are found to 
exist, the status of each member of a series 
depending somewhat upon the status of the 
preceding or conditioning member. For ex- 
ample, the solution of a problact concerned 
with the beautification of the grounds sur- 
rounding my home might involve the follow- 
ing strand: 


Height of beauty 
of the landscape 


in question toward me 


SOLVING PROBLEMS OF ACTION 


Friendliness of 
Neighbor X 


103 


As a rule criterial strands to be most useful 
must begin with immediate criteria and ex- 
tend through intermediate criteria to ultimate 
criteria. immediate criteria are expressed in 
terms of the immediate consequences of the 
choice of a plan of action. They are quali- 
ties either of the hypotheses themselves or of 
objects affected directly by the hypotheses, 
and they should depend entirely or chiefly 
upon the choice made. The advantage of 
employing immediate criteria lies in the fact 
that the estimates of amounts likely to follow 
the supposed adoption of each hypothesis are 
relatively easy to make in their case; and, 
as will be seen later, such estimates provide 
a foundation for the prediction of conse- 
quences in terms of intermediate and ultimate 
criteria. 

Intermediate criteria provide a medium 
through which the probable consequences of 
hypotheses may be traced to ultimate criteria. 
The latter are the final tests of value em- 
ployed in the solution of a problact. Usually 
they are comprehensive values that are con- 
sidered to be of paramount importance, and 
regarding the desirability of which there is a 
high degree of certainty. It is in this cer- 
tainty that the value of the ultimate criteria 
lies, for by virtue of this feature the ultimate 
criteria provide a_ satisfactory basis for 
weighting, an essential step in the evaluation 
process. 

When it is thought that all the significant 
strands of relevant criteria have been set up 
for the evaluation of a set of hypotheses, they 
may be combined into a unified evaluation 
instrument in the form of a criterial hier- 
archy. Usually it is found that some of the 
criteria employed are common to two or more 
of the strands. Such strands should be 
joined together by means of their common 
elements in order to bring into relief signifi- 
cant interrelationships. 

c. Computation of Hypothetical Ratings 
and Value Indices. The actual comparison 
of problact-solving hypotheses begins with the 
estimation of the amount of each immediate 
criterion that is likely to exist following the 
supposed adoption of each hypothesis. The 
estimates are expressed as percentages of the 
optimal amount of each criterion. This is 


Strength of my 
personal prestige 


in the community 
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the ideal or best possible amount, the amount 
that is most favorable to the existence of de- 
sirable amounts of the dependent higher order 
criteria and that at the same time is possible 
under the limitations imposed by nature. 
The estimated percentages of the immediate 
criteria are spoken of as immediate hypotheti- 
cal ratings. 

Hypothetical ratings in terms of intermedi- 
ate and ultimate criteria are derived step by 
step from the immediate ratings, beginning 
with the intermediates, or at times ultimates, 
that are directly conditioned by the immedi- 
ates. First an estimated optimal amount of 
each criterion is divided into three parts 
which are expressed in percentage of the 
optimum. The first part is a predicted in- 
dependent base, the amount of the criterion 
that, owing to influences that are extraneous 
to the problact, will exist regardless of what 
is done by the problact solver in response to 
the problactic situation. The second part 


consists of one or more maximal contingent 
increments, which are the amounts that will 
be added to the independent base if the con- 
ditioning criteria are optimal in amount. 
The third part is the minimal deficiency, the 
portion of optimum that is unattainable due 


to influences that are extraneous to the 
problact. 

The final step in computing the hypotheti- 
cal ratings for each intermediate and ultimate 
criterion consists of determining the portion 
of the maximal contingent increment, or in- 
crements, that probably will accrue from each 
hypothesis. When an intermediate or ulti- 
mate criterion is considered to be in direct 
proportion to a conditioning criterion, the 
hypothetical increments may be computed by 
multiplying the maximal contingent increment 
by the respective hypothetical ratings of the 
conditioning criterion. This is the case in 
the following illustration, taken from the 





Height of beauty 
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strand previously mentioned, in which the 
hypothetical ratings (A and B) of the con- 
ditioning criterion are 75 per cent and 25 
per cent; the predicted independent base, 80 
per cent; the maximal contingent increment, 
10 per cent; the minimal deficiency, 10 per 
cent; and the computed hypothetical in- 
crements, or ratings, 7.5 per cent and 2.5 
per cent: 

In case there are two or more conditioners 
that vary proportionately with the dependent 
criterion and which influence it independently, 
partial hypothetical increments are derived 
from each one separately by the above pro- 
cedure and then added together to form the 
complete hypothetical increments. However, 
if the influence exerted by one conditioning 
criterion depends upon the influence exerted 
by another, one maximal contingent incre- 
ment and complete hypothetical increments 
are predicted for the two acting together, the 
hypothetical increments being estimates 
rather than computations. Below is an illus- 
tration of such a case, the predicted 42 per 
cent rating for Hypothesis A being based 
upon the combination of the two “A”’ ratings, 
80 and 100, of the conditioning criteria, while 
the 33 per cent rating for Hypothesis B is 
based upon the corresponding combination of 
“B” ratings. The predictions in this case 
cannot be made separately, for the reason 
that speed may be so low that economy wil! 
not matter, and likewise economy may be so 
low that speed will not matter. 

While experience has shown that criteria 
as a rule may be expressed in such form that 
dependents and conditioners vary in propor- 
tion, it is conceivable that other relationships 
might at times exist between them. In such 
cases the computation technique or method 
of estimate may be adapted to the relation- 
ship that does exist. Likewise it seems un- 
necessary as a rule to express criteria in nega- 
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A:80 
B:60 











Maximum speed 


of my car 








A:100 | 
B:80 | 





tive form, but if that should be found 
desirable, the ratings may be expressed as 
negative quantities. 

When hypothetical ratings have been de- 
rived for all the ultimate criteria of a criterial 
hierarchy, they are synthesized to form value 
indices for the several hypotheses. The first 
step in the synthesis consists of rating the 
ultimate criteria in importance by assigning 
a weight factor to each. The unit employed 
is the weight attached to the criterion rank- 
ing lowest in importance or value. By multi- 
plying each weight factor by the hypothetical 
ratings of the corresponding ultimate criterion 
partial value indices are determined for the 
respective hypotheses. The sum of the par- 
tial indices determined for each hypothesis 
is the index of value for that hypothesis. 

Step 4. Conclusion. Coming to a conclu- 
sion in problact solving consists merely of 
adopting the course of action—hypothesis— 
having the highest index of value. Obvi- 
ously, the decision should be considered ten- 
tative, for the high ranking hypothesis may 
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be considered good only in a relative sense. 
Whether it is the best possible or ideal policy 
has not been and cannot be determined. 

Moreover, it should be clearly recognized 
that there are many chances for error in the 
solution of a _ problact. The confidence 
placed in a selected policy should depend 
upon the thoroughness with which it has been 
possible to analyze the situation and the 
certainty of the facts upon which the evalua- 
tions have been based. As a policy is carried 
into action its deficiencies likely will become 
apparent as they give rise to new problacts. 

The procedure just outlined has been used 
for some time in education courses that are 
designed to prepare prospective teachers to 
solve teaching problacts. It is recommended 
for use in other fields of vocational educa- 
tion, guidance, social science, and in any 
other fields that are concerned with develop- 
ing proficiency in the solution of action, con- 
duct, or practical problems. Probably the 
procedure should be simplified somewhat if 
it is used at the elementary school level. 











METHODS OF TEACHING APPLIED TO JUNIOR BUSINESS 
TRAINING IN PHILADELPHIA* 


Haro_p B. BUCKLEY 
Supervisor of Commercial Education, 
Philadelphia Public Schools 


I. INTRODUCTION 


The Problem.—What are the relative dif- 
ferences in efficiency of four different methods 
of teaching when applied to junior business 
training in the junior high schools and eve- 
ning high schools of Philadelphia? 
The methods of teaching in this study are 
called: 
1. The Blackboard and Discussion Method. 
2. The Lesson-Sheet and Textbook- 
Reading Discussion Method. 

3. The Blackboard and Textbook-Reading 
Discussion Method. 

4. The Unit Assignment Method. 


Three experiments were conducted to solve 
the problem of this study. 

Experiment 1 compares Method 1 with 
Method 2 when both are applied to the 
teaching of individual lessons during a term. 

Experiment 2 compares Method 1 with 
Method 3 when both are applied to the 
teaching of individual lessons during a term. 

Experiment 3 compares Method 1 with 
Method 4 when both are applied to the teach- 
ing of the units, or series of related lessons, 
in the course of study during a term. In 
each experiment Method 1 is used with the 
control group. 

The essential differences in the methods 
are as follows: 

Method 1—The Blackboard and Discus- 
sion Method uses blackboards for developing 
in the classroom each of the four divisions of 
the individual lesson. The discussion method 
without textbook reading in the classroom is 
used for the presentation part of the lesson. 

Method 2—The Lesson-Sheet and 
Textbook-Reading Discussion Method uses 
mimeographed lesson-sheets for developing in 
the classroom each of the four divisions of 
the individual lesson. The discussion methéd 
with textbook reading in the classroom pre- 


* Doctorate dissertation, Temple University, 1937. 


ceding the discussion is used for the presen- 
tation part of the lesson. 

Method 3—The Blackboard and Textbook- 
Reading Discussion Method uses blackboards 
as in Method 1. The discussion method with 
textbook reading in the classroom preceding 
the discussion is used as in Method 2. 

Method 4—The Unit Assignment Method 
uses a unit or series of mimeographed lesson- 
sheets which are assigned to the pupils for 
independent work and textbook reading after 
the complete assignment has been discussed. 
This method differs from Method 2 in that 
the pupils work at their own rates of speed 
through the lesson-sheets. In this way the 
brighter pupils not only finish enrichment 
material at the end of each lesson-sheet, as 
they did in Method 2, but also are enabled 
to progress more rapidly than the other 
pupils through successive lessons. The 
bright pupils use the time saved to review 
the work of the unit as a whole. 

The following is a detailed description of 
each method: 


1. The Blackboard and Discussion Method. 
This method is used with the control group 
in each experiment. Each daily lesson in the 
classroom in which this method is used con- 
sists of four divisions: 


A. Preparation or approach’ 
This, the first division of each individual 
lesson, the pupils are taught to call the 
Self-reliance Work. The device was 
originated by the writer in 1926, chiefly 
for the purpose of using to best advantage 
the first five minutes of a lesson period 
which are usually wasted. By this de- 
vice, instead of waiting until the pupils 
are quietly seated in the classroom before 
giving orally the usual lesson-preparation 
questions, two questions are written on 
‘John G. Kirk, Harold B. Buckley, and Mary A. Waesche, 


Introduction to Business Manual and Key, p. 11. Phil- 
adelphia: The John C. Winston Co., 1933. Pp. vi + 250. 


106 





— we ef? CP tale CP GC 


a 





—— teens etm 


September, 1937) 


the blackboard before the class arrives, 
as follows: 


Self-reliance Work ----. 

(1) A review question which requires 
for its answer a knowledge of the 
essentials of the previous lesson. 

(2) A leading question which is apper- 
ceptive and which motivates the 
learning of the new lesson. 


Each pupil independently answers ques- 
tion 1 in his notebook as soon as he takes 
his seat, while the teacher records at- 
tendance and does any other necessary 
work usually required at the beginning 
of a lesson period. The device is used 
in such a way that the class begins to 
finish the written work in about five min- 
utes. The first pupil finishing question 1 
automatically takes his work to the 
teacher and, if approved, that pupil be- 
comes Class Leader, who at the front of 
the room does the following things: 


a. gives place numbers to the next nine 
pupils who finish question 1 satisfac- 
torily. The pupils record their place- 
number on a short line after the words 
Self-reliance Work ----. 

b. conducts a discussion about the cor- 
rect answer to question 1 and asks 
some pupil to write the answer on the 
blackboard for the class. 

c. conducts a_ three-minute socialized 
recitation in which are used questions 
that the pupils have already prepared 
as part of their homework. 


At this point, the teacher relieves the 
class leader and: 


a. conducts the marking and collecting of 
homework by the pupil. 

b. conducts the return of the pupils’ 
homework of the preceding lesson. 


c. discusses and demonstrates at the 
blackboard how to mark the writing 
of question 1 for one of fourteen com- 
mon handwriting errors. 


d. calls for answers to question 2 and 
uses the most desirable answers to 
lead to some business problem, the 
solution of which is the essential fea- 
ture of the presentation part of the 
lesson. 


B. 
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Presentation or development?® 

This, the second division of each indi- 
vidual lesson, the pupils are taught to call 
the Problem, because it consists of solving 
some problem related to the pupils’ busi- 
ness lives, such as “How should I write a 
business letter of inquiry?” 

The Blackboard and Discussion 
Method is primarily a method of presenta- 
tion in which the problem of the lesson 
is solved by a class discussion conducted 
by the teacher. In this method, textbook 
information relating to the solution of 
the lesson-problem is not read in the class- 
room. However, it should be remem- 
bered that in this method, as in all other 
methods in this study, textbook reading is 
included as part of the homework. 

The discussion is followed by the de- 
velopment at the blackboard of a brief, 
informative outline, diagram, or model! of 
a business form, which is copied by the 
pupils into their notebooks for future 
reference. 


. Application or conclusion in_ the 


classroom* 

This, the third division of the lesson, the 
pupils are taught to call the Practice 
Work. It is an assignment which is re- 
lated to the problem of the lesson and 
may consist of answering questions, pre- 
paring questions, filling in business forms, 
demonstrating a learned skill, or dramatiz- 
ing a business situation. In order to rec- 
ognize individual differences, the teacher 
encourages the “fast pupils’ who finish 
ahead of the “average pupils” to assist 
the “slow pupils.’”” When the “average 
pupils” finish, the teacher assigns one of 
the “fast pupils” to explain the finished 
work before the class. From this ex- 
planation, the “slow pupils” learn how to 
do the assignment correctly. 


. Application at home* 


This, the fourth division, which is as- 
signed in the classroom and done at home, 
the pupils are taught to call the Home- 
work. It consists of three levels of work: 
(1) For a mark of F, fair, the pupil pre- 
pares one multiple-choice (four ending) 
objective question which deals with the 
problem of the lesson learned in the class- 


2 Ibid., p. 12. 
3 /bid., pp. 13-14. 
* lbid., pp. 14-15. 
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room and is to be used in the socialized 
recitation in the next lesson period; (2) 
for the mark of G, good, the pupil does 
the F assignment and reads the textbook 
information that is related to the problem 
of the lesson learned in the classroom, 
records his reading time, and prepares 
two more multiple-choice questions; and 
(3) for a mark of E, excellent, the pupil 
does the F and G assignments and some 
creative work showing complete mastery 
of the lesson. The homework is so ar- 
ranged that it requires no more than fif- 
teen minutes as a minimum to do assign- 
ments (1), (2), and (3) and to earn a 
mark of excellent. 


The time allotment for a lesson period in 
the classroom is approximately as follows: 
(1) The first third of the period for the 
preparation or approach (i.e., Self-reliance 
Work); (2) the second third of the period 
for the presentation or development (i.e., 
Solving the Problem); and (3) the last third 
of the period for the application in the class- 
room (i.e., Practice Work) and assignment of 
the application at home (i.e., Homework). 
In this study the effective lesson period aver- 
ages about thirty-nine minutes. 

2. The Lesson-Sheet and Textbook-Read- 
ing Discussion Method. This method is the 
same as Method 1 except that: 


A. a mimeographed sheet containing the 
four divisions of the complete daily les- 
son is given to each pupil at the beginning 
of each period. The lesson-sheet elimi- 
nates much blackboard writing by the 
teacher and notebook writing by the 
pupil, as in Method 1. 


B. at the beginning of the presentation part 
of the lesson, textbook reading takes place 
before the discussion starts. It is silent 
reading and is arranged so that it requires 
for most lessons about two or three 
minutes and never more than five minutes 
of the time allotted for the presentation 
part of the lesson. 


. instead of copying an outline from the 
blackboard after the discussion, the pupils 
fill in a series of incomplete-type state- 
ments which are contained on the mime- 
ographed sheet beneath the problem of 
the lesson. The words needed to fill in 
the incomplete statements really take the 
form of an outline. 





JOURNAL OF EXPERIMENTAL EDUCATION 


|Vol. 6, No. 1 


3. The Blackboard and Textbook-Reading 
Discussion Method. This method is the 
same as Method 1 except that textbook read- 
ing takes place at the beginning of the pres- 
entation part of the lesson before the dis- 
cussion starts, as in Method 2. 

4. The Unit Assignment Method. The 
course of study during a term consists of a 
number of units or topics like “The Business 
Letter,” “The Telephone,” “Telegrams, 
Cablegrams, and Radiograms,” etc.° 

Each unit includes a series of individual 
lessons which are related to the same topic. 
For instance, the unit on The Business Letter 
includes five individual lessons, such as, 
“How should I write the parts of a business 
letter?” “How should I write a_ business 
letter of inquiry?” etc. 

Each unit consists of five parts: 


A. Pretest—This consists of fifty multiple- 
choice (four ending) questions which 
deal with essential facts to be learned in 
the unit. A pretest in junior business 
training is possible because the facts re- 
late to common daily business experiences. 

B. Preview lesson.—This aims to give the 
pupils an orientation and overview of the 
unit to be studied and consists of the four 
divisions of an individual lesson. 


C. Individual lessons.—These were previ- 
ously described in detail. 
D. Review lesson.—In this the presentation 


and application in the classroom consist 
chiefly of: (1) Arranging the finished 
work of the unit into a notebook report 
which includes a table of contents and 
cover sheet bearing the title of the unit 
and (2) conducting a socialized recitation 
on the unit. 

E. Review test—This might also be called 
the end test. {ft is the same test that was 
given as the pretest. In this study, the 
difference between the end-test score and 
the pretest score is spoken of as the 
Achievement or Number of Answers 
Learned. 


The unit assignment method is similar to 
Method 2 except that all the lesson-sheets 
for the unit are distributed during the pre- 
view lesson. These lesson-sheets are dis- 
cussed and then assigned as a whole unit for 
independent work in the classroom. The 


5 John G. Kirk, Harold B. Buckley, and Mary A. Waesche, 
op. cit., pp. 22-35. 
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Activity 
Introduction 
to the unit 


Presentation of 
new material 


Fixation of new 
material 
A. In class 


B. At home 


Conclusion of 
the unit 


number of days allotted for independent work 
on the unit is equal to the number of indi- 
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DESCRIPTIVE SUMMARY OF THE METHODS 


Method 1 


—Pretest 
—Preview lesson 
(1) Blackboard outline 


(2) Discussion without 
text reading in class 


(1) Preparation ques- 
tions and Socialized 
Recitation 

(2) Problem stated on 
blackboard 

(3) Discussion without 
text reading in class 


(4) Blackboard outline 


copied into notebooks 


Method 2 


—Pretest 

—Preview lesson 

(1) Mimeographed 
lesson-sheet 


(2) Discussion pre- 
ed by text 
reading in class 


(1) like Method 1 


(2) Problem stated 
on lesson-sheet 
(3) Discussion pre- 
ed by text 
reading in class 
(4) Filling inincom- 


plete statements 


on lesson-sheets 


Method 3 


—Pretest 
—Preview lesson 
(1) like Method 1 


(2) like Method 2 


(1) like Method 1 


(2) like Method 1 
(3) like Method 2 


(4) like Method 1 
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Method 4 


—Pretest 
—Preview lesson 
(1) Unit set of 
mimeographed 
lesson-sheets 
(2) like Method 1 


(1) Preparation 
questions only 


(2) like Method 2 


(3) No discussion, 
but text reading 
independently 

(4) like Method 2 
except it is done 
independently 


(1) Answering questions, (1), (2),(3),(4),and (1), (2),(3),(4),and (1), (2), and (3) like 


oral or written, pre- 
pared by teacher or 
assigned in textbook 

(2) Preparing objective 

uestions 

(3) Filling in business 

forms 

(4) Demonstration of 
specified skills 

(5) Dramatization of 
business situations 


(1) Preparing objective 
uestions 
(2) Reading textbook 
(3) Collecting and pre- 
paring illustrative 
material 


—Review lesson 

(1) Preparing notebook 
report 

(2) Socialized recitation 

—End test 


vidual lessons in the unit. 


The purpose of the study is to solve cer- 
tain administrative and supervisory problems 
encountered by the writer as supervisor of 
commercial education in the day and evening 


TIMELINESS 


high schools of Philadelphia. 


The purposes of Experiment 1 are: 
(1) To compare our present method of teach- 
ing individual lessons in the day junior 
high schools by the blackboard and dis- 


(5) like Method 1 


(1), (2), and (3) like 
Method 1 


—Review lesson 
like Method 1 


—End test 


(5) like Method 1 


(1), (2), and (3) like 
Method 1 


—Review lesson 
like Method 1 


—End test 


Method 1; (4) 
and (5) omitted 


(1) ,(2), and (3) like 
Method 1 


—Review lesson 
like Method 1 


—End test 


cussion method with our present method 
of teaching them in the evening high 


schools by the lesson-sheet and textbook- 
reading discussion method. 


To determine the advantages, if any, of 
lesson-sheets which have been used in 
junior business training in the evening 
high schools of Philadelphia for the past 
five or more years. 


The purposes of Experiment 2 are: 
(1) To compare our present method of 


teaching individual lessons in the day 
junior high schools by the blackboard 
and discussion method, which omits 
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textbook-reading in the classroom, with 
the proposed method of teaching them 
by the blackboard and textbook-reading 
discussion method, which includes text- 
book-reading in the classroom before the 
discussion. 

(2) To determine whether it is educationally 
better to include or exclude the reading 
of the textbook in the classroom. 


The purpose of Experiment 3 is: 

To compare our present method of 
teaching series of related individual les- 
sons or units by the blackboard and dis- 
cussion method, led by the teacher, with 
teaching them by the so-called unit 
assignment method. 

Previous RESEARCH 

“A statement by a man prominent in the 
field of commercial education summarizes 
very well, indeed, the status of research in 
this field. The statement was made in 1926 
by Mr. E. W. Barnhart at a commercial edu- 
cation research conference held at the Uni- 
versity of Iowa. He said at that time: ‘What 
are the solved and the unsolved problems in 
commercial education today? Frankly, none 
appears to be solved.’ It is fair to say that 
this statement is as true today as when first 
it was made.’”° 

The reason may be found in the following 
statement: “Studies of the professional 
preparation of business teachers in five states 
show that from twenty-nine to sixty-three 
per cent of teachers of business subjects hold 
baccalaureate degrees as compared with 
seventy-eight per cent of academic teachers 
who hold degrees. 

“This comparatively low showing on the 
part of the business teachers has not been 
entirely their fault, but has been due in part 
to limited opportunity for professional train- 
ing and improvement.’” 

Most of the research studies in commercial 
education have been: (1) Occupational sur- 
veys; (2) job analyses; and (3) school sur- 
veys. Few studies have dealt with methods 
of teaching commercial subjects and _prac- 
tically all of these have dealt with methods 
of teaching shorthand, typewriting, and 
bookkeeping. No study has been found which 


*W. R. Odell, “Research in Commercial Education,” 
Journal of Business Education, X (April, 1935), 11. 

‘Benjamin R. Haynes and Jessie Graham, Research in 
Business Education. Los Angeles: University of Southern 
California : 32. 


Claude C. Crawford, 1932. Pp. 2 
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deals with methods of teaching junior busi- 
ness training. 


II 
EXPERIMENT NO. I 


The Problem 

What are the relative differences in effi- 
ciency of Method 1 (the blackboard and dis- 
cussion method), compared with Method 2 
(the lesson-sheet and textbook-reading dis- 
cussion method), when the methods are ap- 
plied to teaching individual lessons in junior 
business training? 


Procedure 

To solve the problem, the parallel-group 
or equivalent-group method of experimenta- 
tion is used and Method 1 is used with the 
control group. 


COLLECTION OF DaTA 


Selection of Teachers and Schools 

It was desirable but not possible to con- 
duct the experiment in evening high schools 
because each teacher has only one class. To 
overcome this disadvantage, it was decided 
that teachers of junior business training in 
both day junior high and evening high schools 
should be selected. Four of the five selected 
teachers had taught in both types of schools 
and had used both of the methods included 
in the experiment. 

The experiment was conducted in four 
junior high schools where it was possible to 
arrange with the principal and roster chair- 
man to assign each selected teacher two 
classes of the upper eighth grade. The schools 
selected were in various sections of Philadel- 
phia in order that pupils with possibly indi- 
vidual and community differences might be 
included in the experiment. 


Time and Duration of Experiment 

Both methods of teaching were applied 
daily by each of the five selected teachers in 
two different eighth grade classes during the 
Spring term of eighteen weeks from February 
to June, 1935. 
Directions to Teachers 

Each selected teacher was given: (1) A 
typewritten sheet of directions which dealt 
with the methods of teaching each individual 
lesson, as shown in the appendix, (2) a 
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mimeographed sheet for each class for re- 
porting I.Q.’s, and results of seven junior 
business training tests (one term and six unit 
tests) given as pretests before teaching and 
siven again as review or end tests, etc., as 
shown in the appendix, and (3) specially pre- 
pared three-by-five cards for recording L.Q., 
results of each test, and other information 
about each pupil, as shown in the appendix. 


Data Provided by the Teachers 

Each teacher submitted to the writer: 
(1) A mimeographed sheet for the control 
class and one for the experimental class con- 
cerning I.Q.’s, results of tests, etc., and (2) 
a specially prepared three-by-five record card 
for each pupil in the experiment. A total of 
420 pupil-record cards were received from 
the five teachers conducting the experiment, 
as shown in Table I. 
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In Table Il are reported the means and 
standard deviation for reading scores, I.Q.’s, 
and pretests for the control group on which 
was used Method 1 and for the experimental 
group on which was used Method 2. From 
these, it may be seen that the differences of 
the means in each case are less than the 
probable errors of the differences. 


Summarizing the Results of the Junior Busi- 
ness Training Tests 

Since each junior business training test 
dealt with knowledges, skills, and attitudes 
that were separate, distinct, unrelated, and 
independent of those in the other junior busi- 
ness training tests, it was possible to obtain 
a total test score for each pupil based on 356 
items. Each of these items was answered 
twice, once in a pretest and once again when 
the same test was given as a review or end 


TABLE I 
SOURCES OF DATA FOR EXPERIMENT No. 1 


School No. 
No. Grade Teachers 
ID 55 cco aecdeapsannibelbig ale daitabansatiniehcs 8B 1 
rere ee se ay 8B 2 
eS REAR ASIST eS 8B 1 
a  anenkabiatadianmpudutna 8B 1 
5 


TREATMENT OF DATA 


Equating the Groups 

After the 420 pupil-records were checked 
for accuracy with the information contained 
on the class-report sheets, certain pupil- 
records were selected in order further to 
equate the groups which had already been 
equated as to grade, mean chronological age, 
and teacher. This further equating of the 
groups resulted in 320 pupil-records or 
seventy-six per cent of the total group in the 
experiment. Each pupil in the experimental 
group was matched with another pupil in the 
control group who had the: 

(1) Same number of junior business train- 
ing tests, both pretests and review or 
end tests. 

(2) Same standard score on the Philadel- 
phia Reading Test within 1 point. 

(3) Same 1.Q. within five points. (The 


Philadelphia Mental Ability Test was 
used.) 


No. Pupils, Method1 Pupils, Method 2 
Classes No. Mean I.Q. No. MeanI.Q. 
2 44 110 36 111 
4 90 104 7 111 
2 41 107 42 109 
2 42 115 46 104 

10 217 108 203 109 
test. However, since some of the pupils were 


absent for a given test and matched with an- 
other pupil also absent in the same test, there 
were averages of 323 pretest items and 323 
end test items answered by each pupil. In 
other words, the average number of questions 
answered was 646. 


Distributions were prepared to show scores 
in pretests. Similar and comparable distribu- 
tions show scores for end tests. For each 
pupil the difference between the sum of his 
end test scores and the sum of his pretest 
scores was obtained. This difference was 
called the number of answers learned. 


Since the number of answers learned was 
dependent upon the number of questions not 
answered in the pretests, it was necessary to 
find what per cent of the wrong answers in 
the pretests were answered correctly in the 
end tests. This was found by the following 
formula: 














II2 


Per Cent of Possible Gain = 
Number of Answers Learned 





Total Number of Questions 
in Pretests Minus Total 
Score in Pretests. 





In order to discover what effect, if any, the 
different methods might have had for pupils 
with high and low reading scores, these data 
were studied for high-, low-, and middle- 
reading score groups as well as for the total 
group. 

FINDINGS 

Tables II to V summarize the results of 
Experiment No. 1. 

Table II shows that for the total group 
there is no significant difference in the means 
of the control and experimental groups either 
in the number of answers learned or in the 
per cent of possible gain. The control group 
or Method 1 showed a mean number of an- 
swers learned of 66.81, whereas the experi- 
mental group or Method 2 showed 67.81. 
This difference of 1.00 additional answer 
learned in favor of the experimental group is 
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not significant in view of the fact that the 
probable error of the difference is 1.72 an- 
swers. The difference in mean per cent of 
possible gain is 1.66 per cent points in favor 
of the experimental group. Again, this is not 
a significant difference, since the probable 
error of the difference is 0.65 per cent points. 


However, it should be noted that in both 
of these measures of the success of pupils, 
the differences, while small, are in favor of 
the experimental groups. 


Table III, which deals with the pupils in 
the middle fifty-two per cent of the reading 
score distribution, confirms the findings for 
the total group. It gives additional evidence 
of no significant differences in the means of 
the control and experimental groups in the 
number of answers learned or in the per cent 
of possible gain. 

It should be noted that in both of these 
measures of success of pupils the difference 
of means of the number of answers learned 
is 0.71 in favor of the control group, whereas 
the difference of means of the per cent of 





TABLE II 
COMPARISON OF RESULTS FOR ALL PUPILS 
Method 1 Method 2 P. E. 
Control Group ~— Group Dif. Dif. 
(N=160) =160) of of 
oe -- - Means Means 
. E. P. E. 
Mean S§.D. Means Mean S§S.D. Means 
Phila. Reading Test 
Standard Score___. 12.0 3.17 1.69 12.0 3.20 1.70 0.0 2.41 
Phila. Mental iamnned Test 
I. Q. 108.9 11.1 0.59 108.9 23.3 0. 59 0.0 0. 84 
Pretests____ 167.38 33.10 1.77 168.75 35.10 1.87 (X) 1.37 2. 57 
End Tests__.. ._ 234.19 40.70 2.17 236.56 48.50 2.58 (X) 2.37 3.37 
Number of Answers 
Learned_____- 66.81 22.40 1.19 67.81 23.40 1.25 (X) 1.00 1.72 
Per cent of Possible Gain.. 43.40 12.10 .64 45.06 12.20 .65 (X) 1. 66 . 65 
TABLE III 


COMPARISON OF RESULTS OF PUPILS IN THE MIDDLE 52 Per CENT OF THE 
READING SCORE DISTRIBUTION 





Method 1 Method 2 P. E. 

Control Group “i Group Dif. Dif. 

(N= 84) = 84) of of 
Means Means 

P.E P. E. 
Mean S.D. Means Mean S.D. Means 
Pretests______ 163.69 28.60 2.10 170.95 34.40 2.53 (X) 7.26 3.29 
End tests______ 231.90 37.60 2.77 238.45 48.30 3.55 (X) 6.55 4. 50 
Number of Answers 

SE oe 68.21 22.50 1.65 67.50 22.00 1.62 (C) .71 2. 30 
Per cent of Possible Gain.. 43.22 11 40 .84 44.17 13.30 .98 (X) .95 1.29 

















September, 1937] 


possible gain is 0.95 per cent points in favor 
of the experimental group. 

Table IV shows that, for the pupils in the 
upper twenty-six per cent of the reading score 
distribution, there is no significant difference 
in the means of the control and experimental 
groups either in the number of answers 
learned or in the per cent of possible gain. 

In both of the measures of the success of 
pupils, the difference in mean number of an- 
swers learned (7.03) favors the experimental 
group and is approximately twice the prob- 
able error of the difference of the means 
(3.54). The difference in mean per cent of 
possible gain (1.70 per cent points) also 
favors the experimental group and is greater 
than the probable error of the difference of 
the means (1.34). 

Table V shows that for the pupils in the 
lower twenty-two per cent of the reading 
score distribution there is no significant dif- 
ference either in the mean number of answers 
learned or in the mean per cent of possible 
gain. However, it should be noted that the 
difference in both measures favors the con- 
trol group. 
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SUMMARY OF SIGNIFICANT FINDINGS 


Figure 1 shows the percentile distributions 
of answers lear.ied expressed as per cents of 
possible gain. it confirms the findings of the 
preceding tables. For instance, Figure 1 
shows slightly higher results in Method 2 for 
pupils in the upper part of the distribution 
of percentage of possible gain. This confirms 
the findings of Table IV, which shows that 
the pupils with the higher reading scores had 
higher results with Method 2. While the 
evidence is not conclusive on this point, 
there is some reason for thinking that pupils 
at the upper end of the reading score dis- 
tribution and those at the upper end of the 
distribution of percentage of possible gain 
are more successful with Method 2. To some 
degree these are the same pupils. 


Figure I shows slightly higher results for 
the lower part of the distribution of percent- 
ages of possible gain in Method 1. This con- 
firms the data of Table V, which shows that 
pupils in the lower end of the reading score 
distribution are more_ successful with 
Method r. 


TABLE IV 


COMPARISON OF RESULTS OF PUPILS IN THE UPPER 26 PER CENT OF THE 
READING SCORE DISTRIBUTION 


Method 1 
Control Group 
(N= 41) 


Method 2 P. E. 
~ Group Dif. Dif. 
(N= 41) of of 


———— Means Means 





P. E. P. E. 
Mean S.D. Means Mean S.D. Means 
Rs «ats a eee 189.10 37.30 3.93 180.85 31.30 3.30 (C) 8.25 5.11 
 } 252.56 46.90 4.94 251.34 47.80 5.04 (C) 1.22 7.06 
Number of Answers 
te ie ls 63.46 23.50 2.48 70.49 23.90 2.52 (X) 7.03 3. 54 
Per Cent of Possible Gain._ 47.93 12.70 1.34 49.63 12.70 1.34 (X) 1.70 1. 34 
TABLE V 
COMPARISON OF RESULTS OF PUPILS IN THE LOWER 22 PER CENT OF THE 
READING SCORE DISTRIBUTION 
Method 1 Method 2 P. E. 
Control Group er, Group Dif. Dif. 
(N= 35) (N= 35) of of 


Means Means 





P. E. P. E. 
Mean S§.D. Means Mean S.D. Means 
I is 55 8 tins aie 150.71 23.50 2.68 149.86 33.20 8.78 (C) .85 4. 63 
ee 218.14 30.90 8.52 214.71 41.70 4.76 (C) 3.43 5.92 
Number of Answers 
Learned___._.._.____.... 67.48 20.40 8.32 64.85 22.20 2.53 (C) 2.58 4.17 
Per Cent of Possible Gain.. 38.71 11.20 1.28 38.438 11.70 1.33 (C) .28 1. 84 


= 
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vem o METHOD 1. BLACKBOARD AND Discussion MeTHoD. 
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Fig. |- ExPeriment No.l Percentire DisteieuTions oF 
Answers Learnep ExPressen AS Per Cents or FossiGse Gain 


III 
EXPERIMENT No. 2 


The Problem 

What are the relative differences in effi- 
ciency of Method 1 (the blackboard and dis- 
cussion method), compared with Method 3 
(the lesson-sheet and textbook-reading dis- 
cussion method) when the methods are ap- 
plied to teaching individual lessons in junior 
business training? 


Procedure 

The procedure is the same as in Experi- 
ment No. 1, except where differences are 
noted. 


COLLECTION OF DaTA 


Selection of Teachers and Schools 

Three of the five selected teachers had 
taught junior business training in both day 
and evening high schools and all of the 
teachers had experience with both of the 
methods of teaching in this experiment. 


Directions to Teachers 

The typewritten sheet of directions which 
dealt with the methods of teaching each in- 
dividual lesson was similar to that shown in 
the appendix, except that the procedure for 
Method 3 was used instead of Method 2. 
Method 3 differed from Method 1 only in 








: 
4 
4 
6 
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that the class discussion in the problem- 
solving division was preceded by textbook 
reading in the classroom. 


Data Provided by the Teachers 


A total of 396 pupil-record cards were re- 
ceived from the five teachers, as shown in 
Table VI. 


TREATMENT OF DATA 


Equating the Groups 

Further equating of the groups resulted in 
250 pupil-records or sixty-three per cent of 
the total group in the experiment. Each pupil 
in the experimental group was matched with 
another pupil in the control group who had 
the: (1) same number of junior business 
training tests, both pretests and end tests; 
and (2) same I.Q. within two points.® 


* This teacher was assigned three 8B classes. In two of the 
classes he conducted Experiment No. 1 and in the third class 
he conducted Experiment No. 2. Otherwise, Experiment 
No. 2 was conducted in schools different from those in Ex- 
periment No. 1. 

“In Experiments 2 and 3, the I. Q. was used because 
higher correlation was found to exist between I. Q. and per 
cent of possible gain than between reading score and per cent 
of possible gain. The correlation coefficients are as follows 
for the control and experimental groups in Experiment No. 1: 
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From Table VII it may be seen that each 
group was equal as to mean I.Q. and that the 
difference of the mean of the pretests, 
although it was 3.64 answers in favor of the 
experimental group, was not significant. The 
difference was only slightly larger than the 
probable error of the difference (3.03). 


Summary of the Results of the Junior Busi- 
ness Training Tests 
The average number of questions answered 
by each of the 250 pupils was 666 questions, 
of which 333 were on pretests and 333 on end 
tests. 


FINDINGS 


Tables VII to X summarize the results of 
Experiment No. 2. 

Table VII shows that for the total group 
there is no significant difference in the means 
of the control and experimental groups either 
in the number of answers learned or in the 
per cent of possible gain. The mean number 
of answers learned by the control group or 
Method 1 was 79.84 and by the experimental 
group or Method 3 was 74.28. This differ- 


Control Experimental cas ome . 
Secen nadine wate end On ence of 5.56 additional answers learned favors 
<A Sand gis-—_--- 37 & 0S 36+ .05 the control group, but is not significant in 
ween an r cent o ° 
‘Deble gain. 50+ 04 ~= «$7 + 04 ~—- View of the fact that the probable error of 
TABLE VI 


SOURCES OF DATA FOR EXPERIMENT No. 2 





School No. No. Pupils, Method1 Pupils, Method 3 
No. Grade Teachers’ Classes No. MeanI.Q. No. MeanI.Q. 
Deceased eiscnticdiineae 8B 1 2 49 100 51 100 
1 23 100 a = 
EF :accmmlanbih atbiiaindsbsinte 8B 2 2 40 108 32 104 
2 35 93 40 114 
| Ee eae 8B 1 1 ak aint 40 102 
ee RE, ae 8B 1 2 45 108 41 108 
5 10 192 102 204 105 
TABLE VII 
COMPARISON OF RESULTS FOR ALL PUPILS 
Method 1 Method 3 , P. E. 
Control Group — Group Dif. Dif. 
(N=125) (N=125) of of 
- Means Means 
P. E. P. E. 
Mean S.D. Means Mean S.D. Means 
Phila. Mental Ability Test 
i reset % * 104.3 10. 65 0.64 104.3 10. 65 0. 64 0.0 0.95 
7. a ara 165.80 36.60 2.21 169.44 34.50 2.08 (X) 3.64 3.03 
|, rer 245.64 49.50 2.99 243.72 44.30 2.67 (C) 1.92 4.01 
Number of Answers 
Learned................ 79.84 26.90 1.638 74.28 25.00 50 (C) 5.56 2.21 
Per cent of Possible Gain. 44.52 15.20 .92 46.12 14.90 90 (X) 1.60 1, 28 
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the difference is 2.21 answers. On the other 
hand, the difference in mean per cent of pos- 
sible gain is 1.60 per cent points in favor of 
the experimental group, but this is not a sig- 
nificant difference, since the probable error 
of the difference is 1.28 per cent points. 

Table VIII, which deals with the pupils in 
the middle fifty per cent of the I.Q. distribu- 
tion, confirms the findings for the total group. 
It gives additional evidence of the difference 
in mean number of answers learned (1.91) 
favoring very slightly the control group, 
whereas the difference in mean per cent of 
possible gain (2.54) favors very slightly the 
experimental group, but in neither case is the 
difference significant. 
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Table LX, which deals with the pupils in 
the upper fifteen per cent of the 1.Q. distribu- 
tion, further confirms the findings for the 
total and middle groups. It should be noted, 
however, that the difference of 14.21 addi- 
tional answers learned favors the contro! 
group and is more than three times the prob- 
able error of the difference of the means 
(4.38). On the other hand, the difference in 
mean per cent of possible gain is 5.32 per 
cent points in favor of the experimental 
group, but this is not a significant difference, 
since the probable error is 3.11 per cent 
points. 

Table X shows that for the pupils in the 
lower thirty-five per cent of the I.Q. distribu- 











TABLE VIII 
COMPARISON OF RESULTS OF PUPILS IN THE MIDDLE 50 PER CENT OF THE I.Q. DISTRIBUTION 
Method 1 Method 3 P.. =. 
Control Group ww. ty 7 Dif. Dif. 
= 63) of of 
— - Means Means 
Mean iS. D. Means Mean S. D. Means 
Pretests .. 173.73 34.20 2.90 171.35 27.90 2.37 (C) 2.38 3.74 
End tests _ 256.59 45.50 3.86 252.30 39.80 3.38 (C) 4.29 5. 42 
Number of Answers 
Learned ____- ... 82.86 29.90 2.54 80.95 26.40 2.24 (C) 1.91 3. 38 
Per Cent of Possible Gain. 45.64 15.70 1.33 48.18 14.70 1.25 (X) 2. 54 1. 82 
TABLE IX 
COMPARISON OF RESULTS OF PUPILS IN THE UPPER 15 PER CENT OF THE I.Q. DISTRIBUTION 
Method 1 Method 3 P. E. 
Control Group aS Group Dif. Dif. 
= 19) = 19) of of 
- Means Means 
Mean S.D. Means Mean S.D. Means 
Pretests _. 186.05 39.10 6.04 204.47 29.90 4.63 (X) 18.42 7.61 
End Tests . 275.53 49.10 7.59 279.74 35.30 5.46 (X) 4.21 9.35 
Number of Answers 
Learned ... 89.48 20.30 3.14 75.27 19.80 3.06 (C) 14.21 4.: 
Per Cent of Possible Gain.. 46.00 14.50 2.24 61.32 14.00 2.16 (X) 5.32 3.11 
TABLE X 
COMPARISON OF RESULTS OF PUPILS IN THE LOWER 35 Per CENT OF THE I.Q. DISTRIBUTION 
Method 1 Method 3 P. E. 
Control Group a Group Dif. Dif. 
(N= 43) = 43) of of 


Means Means 





P. E. 
S.D. Means Mean 


P. E. 
S.D. Means 


Mean 
Pretests...___- . 145.23 25.90 2.66 148.26 30.50 3.14 (X) 3.03 4.11 
End Tests 216.40 39.90 4.10 219.88 41.20 4.24 (X) 3.48 5. 90 
Number of Answers 
Learned_. .-. we Be 2.58 71.62 23.90 2.46 (X) 0.45 3. 56 
Per Cent of Possible Gain. 38.26 11.20 1.15 40.81 138.70 1.41 (X) 2.55 1. 82 
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tion there is no significant difference. How- 
ever, it should be noted that in both of the 
measures of the success of pupils, the differ- 
ences, while small (0.45 additional answers 
learned and 2.55 additional per cent points 
of possible gain), are in favor of the experi- 
mental groups. 


SuMMARY OF SIGNIFICANT FINDINGS 


Figure 2, which shows the percentile dis- 
tributions of answers learned expressed as 


ANSWERS LEARNED ExPrRessepd As 
Per CenT oF PossiBre GAIN 


10% 20%, Joh 4%, Sof, 
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percentage of possible gain, confirms the gen- 
eral findings of the preceding tables. For in- 
stance, Figure 2 shows that for the total 
group neither method is definitely better. It 
shows that Method 1 is slightly better for the 
pupils in the upper part of the distribution 
of percentage of possible gain, whereas 
Method 3 is slightly better for most of the 
pupils in the lower part of the distribution 
of percentage of possible gain. 

For pupils in the upper part of the dis- 





bo% 70%, 80% Jog 107 


PERCENTILES 
cammme METHOD 1. BLASNBOARD AND Discussion METHop. 


cowoe METHOD 3. Be 


ACK BOARD AND Tex7 BooK- READING 


ISCUSSION METHOD 


Fig. Z-ExPerimenTt No. Z, PERCENTILE DisTRIBUTIONS OF 
Answers LEARNED Ex PRESSED AS Fer Cents of POSSIBLE. GAIN. 
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tribution of percentage of possible gain, 
Method 1 is more successful. This is in con- 
tradiction to the results of findings in Table 
[X which shows that pupils in the upper part 
of the I.Q. distribution were more successful 
with Method 3. This is evidence of the un- 
reliability of definite conclusions as to the 
superiority of either method for the upper 
1.Q.-upper achievement group. 


IV 
EXPERIMENT NO. 3 


The Problem 

What are the relative differences in effi- 
ciency of Method 1 (the blackboard and dis- 
cussion method), compared with Method 4 
(the unit assignment method), when the 
methods are applied to teaching the units or 
topics in junior business training? 
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feasible. Hence, it was decided that in this 
experiment a time limit should be set for the 
study of each unit. The number of days 
allotted for independent work on the unit 
was to be equal to the number of individua! 
lessons in the unit. In this way, the time 
allotment for the study of each unit was the 
same for each method in this experiment. 


Data Provided by the Teachers 

A total of 264 pupil-record cards were 
received from the two teachers, as shown in 
Table XI. 


TREATMENT OF DATA 


In Experiment No. 3 some of the pupils 
were of the low ninth (9A) grade. Their 
median age was approximately 14 years, 
5 months. The remainder were of the high 
ninth (9B) grade. Their median age was ap- 


TABLE XI 
SOURCES OF DATA FOR EXPERIMENT No. 3 


School No. 
No. Grade Teachers 
1 eT eee Se 1 
ee 1 
2 


Procedure 

The procedure is the same as in Experi- 
ment No. 1, except where differences are 
noted. 


COLLECTION OF DaTA 


Selection of Teachers and Schools 

One of the two teachers in the experiment 
had taught junior business training in both 
day junior and evening high schools. 


Time and Duration of Experiment 

Before conducting this experiment, a 
method somewhat similar to Method 4 had 
been tried by ten teachers in the evening high 
schools. In this method the pupils were 
given lesson-sheets to work on at their own 
rate of speed and without any time limit for 
each unit. This method proved unsatisfactory 
because in a relatively short time each pupil 
in the class was working on a different lesson- 
sheet so that class or group instruction was 
impossible and individual instruction was not 


No. Pupils, Method1 Pupils, Method 4 
Classes No. MeanI.Q. No. MeanI1.Q. 
3 32 105 36 102 
ines pea sania 36 103 
4 40 104 42 102 
on 33 109 45 99 
7 105 106 159 101 
proximately 14 years, 10 months. The oA 


units or topics were the Intelligent Buyer, 
Business Procedure in Buying, Successful 
Selling, Selling in Business, Transportation of 
People, and Transportation of Goods by 
Mail. The 9B units or topics were Money 
and Credit, Thrift, Budgets and Records, 
Investments, Insurance, and Bank Services. 


Equating the Groups 

Further equating of the groups resulted in 
184 pupil-records or seventy per cent of the 
total group in the experiment. Each pupil in 
the experimental group was matched with 
another pupil in the control group who had 
the: (1) same number of junior business 
training tests, both pretests and end tests; 
(2) same grade; and (3) same I.Q. within 
three points. 

From Table XII it may be seen that each 
group was equal as to mean I.Q. and that the 


*In school No. . 3 oS ae happened to be 
conducted; otherwise, each experiment was conducted in a 
different school. This Experiment No. 3 was conducted by 
east and with pupils who were not in Experiments No. | 

0. 
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difference of the mean of the pretests, 
although it was 7.39 questions in favor of the 
control group, was not significant. The dif- 
ference was less than twice as large as the 
probable error of the difference (3.78). 


Summary of the Results of the Junior Busi- 
ness Training Tests 
The average number of questions answered 
by each of the 184 pupils was 694 questions, 
of which half were on the pretests and half 
on the end tests. 


FINDINGS 


Tables XII to XV summarize the results 
of Experiment No. 3. 

Table XII shows that for the total group 
there is no significant difference in the means 
of the control and experimental groups either 
in the number of answers learned or in the 
per cent of possible gain. The mean number 
of answers learned by the control group or 
Method 1 was 66.73 and by the experimental 
group or Method 4 was 70.00. This differ- 
ence of 3.27 additional answers learned favors 
the experimental group, but is not significant, 
since the probable error of the difference is 
2.37 answers. The difference in mean per- 
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points in favor of the experimental group. 
Again, this is not a significant difference, since 
the probable error of the difference is 1.40 
per cent points. 

It should be noted, however, that in both 
of these measures of the success of pupils, the 
differences, while small, are in favor of the 
experimental groups. 

Table XIII deals with the pupils in the 
middle fifty-seven per cent of the 1.Q. dis- 
tribution. It confirms the findings for the 
total group in that it shows the differences in 
the mean number of answers learned (4.62) 
and the mean per cent of possible gain (1.54) 
are not significant, although both differences 
favor slightly the control groups. 

Table XIV concerns the pupils in the upper 
sixteen per cent of the I.Q. distribution. It 
shows the difference in mean number of an- 
swers learned to be 0.67 answers in favor of 
the experimental group and the difference in 
mean per cent of possible gain to be 2.00 per 
cent points in favor of the same group, but 
in neither case is the difference significant ac- 
cording to the probable error of the difference. 

Table XV deals with the pupils in the 
lower twenty-seven per cent of the I.Q. dis- 








centage of possible gain is 1.63 per cent tribution. It shows a difference of 18.00 
TABLE XII 
COMPARISON OF RESULTS FOR ALL PUPILS 
Method 1 Method 4 P. E. 
Control Group Exp. Group Dif. Dif. 
(N= 92) (N= 92) of of 
— Means Means 
P. E. P. E. 
Mean S§.D. Means Mean S.D. Means 
Phila. Mental Ability Test 
Si aaa eee a 105.4 9.17 0.64 105.4 9.55 0. 67 0.0 0. 92 
| ee eee: 203.48 38.20 2.68 196.09 39.60 2.67 (C) 7.39 3.78 
 " & See 270.21 46.90 8.30 266.09 57.90 4.07 (C) 4.12 5.24 
Number of Answers 
tks ie nn cnnweed 66.73 18.50 1.30 70.00 28.20 1.98 (X) 3.27 2.37 
Per Cent of Possible Gain.. 43.15 12.20 .86 44.78 15.80 1.11 (X) 1.63 1.40 
TABLE XIII 
COMPARISON OF RESULTS OF PUPILS IN THE MIDDLE 57 PER CENT OF THE I.Q. DISTRIBUTION 
Method 1 Method 4 F P. E. 
Control Group a Group Dif. Dif. 
(N= 52) = 52) of of 
Means Means 
P. E. P. E. 
Mean S.D. Means Mean S.D. Means 
i is cennisnicen 203.27 35.40 3.31 190.19 40.50 3.79 (C) 13.08 5. 03 
Paes 272.89 39.90 3.73 255.19 60.70 5.68 (C) 17.70 6.79 
Number of Answers 
5 lee le ae 69.62 16.80 1.57 65.00 27.90 2.61 (C) 4.62 3.04 
Per Cent of Possible Gain.. 44.62 9. 60 -90 43.08 15.40 1.44 (C) 1.54 1.70 
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TABLE XIV 
COMPARISON OF RESULTS OF PUPILS IN THE UPPER 16 Per CENT OF THE I.Q. DISTRIBUTION 
Method 1 Method 4 : P. E. 
Control Group Exp. Group Dif. Dif. 
(N= 15) (N= 15) of of 
—___-__—_— - - Means Means 
P. E. P. E. 
Mean S.D. Means Mean S.D. Means 
Pretests . 220. 33 31.40 5.47 216.33 31. 80 5.54 (C) 4.00 7. 40 
End Tests _ 298.33 29.60 5.16 295.00 46.70 8.14 (C) 3.33 9. 64 
Number of Answers 
Learned _. 78.00 10.90 1.90 78.67 27.50 4.80 (X) .67 5.16 
Per Cent of Possible Gain-. 50.33 10.90 1.89 652.33 14.80 2.58 (X) 2.00 3. 20 
TABLE XV 
COMPARISON OF RESULTS OF PUPILS IN THE LOWER 27 PER CENT OF THE I.Q. DISTRIBUTION 
Method 1 Method 4 P. E. 
Control Group Exp. Group Dif. Dif. 
(N= 25) (N= 25) of of 
——_----——_ - Means Means 
P. E. . E. 
Mean S.D. Means Mean S.D. Means 
Pretests__. 193.80 438.50 5.87 196.20 37.80 5.10 (X) 2.40 7.78 
End tests _ 247.80 657.10 7.70 268.20 651.50 6.95 (X) 20.40 10.37 
Number of Answers 
Learned___. 54.00 22.70 3.06 72.00 27.30 3.68 (X) 18.00 4.78 
Per Cent of Possible Gain.. 35.40 14.10 1.90 43.80 16.10 2.17 (X) 8.40 2.88 


answers in the mean number of answers 
learned in favor of the experimental group 
and the difierence is practically four times 
greater than the probable error of the differ- 
ence (4.78). It also shows 8.40 per cent 
points difference in the mean percentage of 
possible gain ia favor of the experimental 
group, and the difference is nearly three times 
greater than the probable error of the dif- 
ference (2.88). 


SUMMARY OF SIGNIFICANT FINDINGS 


Figure 3 shows the percentile distributions 
of answers learned expressed as percentage of 
possible gain. These distributions confirm 
the findings shown in Tables XII, XIII, and 
XIV for the total, middle, and upper I.Q. 
groups, but conflict with the findings for the 
lower I.Q. groups (Table XV). Both Figure 
3 and Table XIV show slight advantages of 
Method 4 for the upper part of the distribu- 
tion of percentage of possible gain and the 
upper part of the I.Q. distribution, respec- 
tively. However, Figure 3 shows a slight 


advantage of Method 1 for the lower part of 
the distribution of percentage of possible gain 
in contrast to a slight advantage of Method 4 
for the lower part of the I.Q. distribution 
(Table XV). 


This is evidence of the unreli- 


ability of definite conclusions as to the supe- 
riority of either method for the lower I1.Q.- 
lower achievement group. 


V 
GENERAL SIGNIFICANCE OF THE FINDINGS 


In a series of three experiments, three 
varying methods of teaching junior business 
training were compared with one control 
method. Parallel or equivalent groups of 
pupils were used to determine the superiority, 
if any, of any one of the four methods. Re- 
sults were studied for each of the total groups 
and for subdivisions of the total groups ac- 
cording to their relative achievement, whether 
high, average, or low. 


In no case has clear-cut evidence been 
found as to the superiority of any one method 
at any point in the achievement range. Ac- 
cordingly, the selection of the method to be 
used should depend upon other factors than 
the ability and achievement of the pupils. 
Such factors are: 

(1) available equipment, including lesson- 

sheets. 

(2) the teacher’s training and personality. 

(3) the learning habits of the pupils. 
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However, there is slight evidence that the 
high ability pupils profit by the greater free- 
dom offered them with prepared lesson-sheets 
(Methods 2 and 4). Pupils in the low abil- 
ity groups do not seem to profit by the use 
of lesson-sheets. However, the evidence on 
this point is conflicting in Experiment No. 3 
where the lesson-sheets were used for com- 
plete unit assignments. 

The factor of reading in class was con- 
trasted with assigned reading at home in Ex- 
periment No. 2. Reading in class seems to 





favor the low ability groups, while use of 
class time for other purposes seems to favor 
the high ability groups. On this last point, 
the evidence is again conflicting, depending 
on how the high ability group is defined. 
Thus, the selection of the high ability group 
as the upper fifteen per cent of the I.Q. dis- 
tribution shows an advantage for textbook 
reading for this group (Table IX). On the 
other hand, the selection of the upper fifteen 
per cent of the group according to percentage 
of possible gain shows an advantage for the 
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method not requiring textbook reading in 
class (Figure 2). Discrepancies such as these 
may be due to the relatively small numbers 
of pupils involved when the total groups were 
subdivided into high-, middle-, and low- 
ability groups. 

To determine the relative success of the 
various methods, it would be desirable to 
study the achievement of pupils actually 
segregated into ability groups large enough to 
furnish results which would be reliable. 
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Appendix 


TYPEWRITTEN SHEET OF DIRECTIONS FOR EACH TEACHER CONCERNING METHODS OF TEACHING 
IN EXPERIMENT No. 1 





Directions for Comparing 


Method 1, the blackboard and discussion method, with 
Method 2, the lesson-sheet and textbook-reading discussion method 


Procedure 
Method 1 Method 2 


I. Self-reliance work: 
This part of the lesson should be the same for each class as to content, methods, and 
time-allotment (13 minutes) except as follows: 
Method 1 class (the control group) Method 2 class (the experimental 


will copy the work from the black- group) will copy the work from the 
board. mimeographed lesson-sheet. 


II. Problem: 
This part of the lesson should be the same for each class as to content, methods, and 
time-allotment (13 minutes) except as follows: 
Problem of the lesson to be solved by: 
lst—Class discussion led by teacher. 1st—Silent textbook reading of pages 
2nd—Development of an outline, dia- related to lesson-problem. Max- 
gram, or model business form by imum time allowed for reading 


the teacher at the blackboard. to be 5 minutes. 
2nd—Class discussion led by teacher. 


3rd—Filling in incomplete statements 
on lesson-sheet. 
III. Practice work: 
This part of the lesson should be the same for each class as to content, methods, and 
time-allotment (13 minutes) except as follows: 
Method 1 class will copy practice Method 2 class will copy practice 
work from blackboard. work from lesson-sheet. 


IV. Homework: 
This should be the same for each class. Pupils in Method 2 class will keep the 
lesson-sheets in their notebooks. 
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Mimeographed Sheet for Reporting Class Results in Experiment No. i 






SCHOOL DISTRICT OF PHILADELPHIA, DEPARTMENT OF SUPBRINTENDENCE 
DIVISION OF COMMERCIAL EDUCATION 







REPORT OF PUPILS’ ACHIEVEMENT ON TBESTs’' 
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CODE: | NE eT re TR wn amen MV Eee, We ee 
*R = Pupils raw score for review test (Name of Test) (Time) 
P = Pretest mark Test No. 2—_........ \titsteing tibiae aactaalacaieeea nia téndudade ae 
A =(R—P =A) Answers Learned | | |. 2 SER e sae naheheteeicael Re atdhinhbelee aid 
**—-Encirele numbers indicating boys. Write last name only. EEE ESAS ( OER Cee 
5 Cee eseasoueiddl ee, ee | 
Test ES TS ERATE IEE eee , ee | 
pk) eR Cae ( Pein ; 





(Time allowed should be the same for each pretest and review test; i. e., 20 
min. for term test and 15 min. for topic tests.) 

















(‘) Term —_— and review test contained 56 questions: Nos. 1-50 and 95-100 of Term Pretest and Review Pretest No. 1 of Introduction to Business 
Pretests and Review Tests by Kirk, Buckley, and Waesche (Published by The John C. Winston Co., Phila.) 

Tests No. - 2, 3, 4, 5, and 6 respectively were unit or topic tests of fifty questions each on the Business Letter: The Telephone; Telegrams, Cable- 
grams, and Radi 4 : Getting Along with People; Sources of Business Information; and Filing and Finding Information. hey are called Topic Pre- 
tests and Review ests because they are used as both pretests and as review or end tests. 
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RePorRT CARD OF INDIVIDUAL PUPIL’S RECORD IN EXPERIMENT No. 1 
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L.N.—Last name; M or F—male or female; Age (Y) years (M) months; C or X—in con- 
trol or experimental class; 1, 2, 3—in Experiment No. 1, 2, or 3. 

P.Oc.—Parent’s occupation; F or M—father or mother; R.S.*—reading score; I.Q.**— intel- 
ligence quotient. 

Term: R = term review test, number of answers correct; P = term pretest, number of an- 
swers correct; A — achievement or Number of Answers Learned for term test, shown as 
the difference between the pretest score and the review test score; A.P. = number of An- 
swers Possible to learn; Ab—number of days absent during the term. 

B.L.—Business letter test results: R—review test; P—pretest; A—answers learned or dif- 
ference between R and P. 

Tp.—Telephone topic tests results: R—review test; P—pretest; A—answers learned or dif- 
ference between R and P 

Tg.—Telegraph topic tests results: R—review test; P—pretest; A—answers learned or dif- 
ference between R and P 

C.P.—Communicating in person topic results: R—review test; P—pretest; A—answers 
learned or difference between R and P. 

B.I.—Business information topic results: R—review test; P—pretest; A—answers learned 
or difference between R and P 

F.—Filing and Finding information results: R—review test; P—pretest; A—answers 
learned or difference between R and P. 

Report marks: (1) for 1st report period, at end of 6 weeks. 
(2) for 2nd report period, at end of 12 weeks. 
(3) for 3rd report period, at end of 18 weeks. 


School (initial of school to show from which school the card came). 
Repeater? _____- Yes or No, as to whether or not pupil is repeating the grade. 
A — A.P. = Per Cent of Possible Gain. 


_* R.S.—reading score obtained by teacher after using Direc- ** 1.Q.— intelligence quotient obtained by teacher from 
tions for Administration and Scoring of Tests issued by Divi- results of Philadelphia Mental! Ability Test for 8B issued by 
sion of Educational Research and Results, Philadelphia, Pa., Division of Educational Research and Results. 
_ pares * Philadelphia Silent Reading Test for Compre- 

nsion and Rate. 








